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ABSTRACT 

Context. Future dark energy space missions such as JDEM and EUCLID are being designed to survey the galaxy population to trace 
the geometry of the universe and the growth of structure, which both depend on the cosmological model. To reach the goal of high 
precision cosmology they need to evaluate the capabilities of different instrument designs based on realistic mock catalogs of the 
galaxy distribution. 

Aims. The aim of this paper is to construct realistic and flexible mock catalogs based on our knowledge of galaxy populations from 
current deep surveys. We explore two categories of mock catalogs : (i) based on luminosity functions that we fit to observations 
(GOODS, UDF,COSMOS,VVDS) (ii) based on the observed COSMOS galaxy distribution. 

Methods. The COSMOS mock catalog benefits from all the properties of the data-rich COSMOS survey and the highly accurate 
photometric redshift distribution based on 30-band photometry. Nevertheless this catalog is limited by the depth of the COSMOS 
survey. Thus, we also evaluate a mock galaxy catalog generated from luminosity functions using the Le Phare software. For these two 
catalogs, we have produced simulated number counts in several bands, color diagrams and redshift distributions for validation against 
real observational data. 

Results. Using these mock catalogs we derive some basic requirements to help design future Dark Energy missions in terms of the 
number of galaxies available for the weak-lensing analysis as a function of the PSF size and depth of the survey. We also compute 
the spectroscopic success rate for future spectroscopic redshift surveys ( i) aiming at measuring BAO in the case of the wide field 
spectroscopic redshift survey, and (ii) for the photometric redshift calibration survey which is required to achieve weak lensing 
tomography with great accuracy. In particular, we demonstrate that for the photometric redshift calibration, using only NIR (l-1.7yum) 
spectroscopy, we cannot achieve a complete spectroscopic survey down to the limit of the photometric survey (/ < 25.5). Extending 
the wavelength coverage of the spectroscopic survey to cover 0.6-1.7/iin will then improve the fraction of very secure spectroscopic 
redshifts to nearly 80% of the galaxies, making possible a very accurate photometric redshift calibration. 

Conclusions. We have produced two realistic mock galaxy catalogs that can be used in determining the best survey strategy for 
future dark-energy missions in terms of photometric redshift accuracy and spectroscopic redshift surveys yield. These catalogues are 
publicly accessible at http://lamwws.oamp.fr/cosmowiki/RealisticSpectroPhotCat or by request to the first author of this paper. 

Key words, redshift - JDEM - cosmology - surveys - mock catalog 



1. Introduction 

The prospect for high-precision cosmological inferences from 
large galaxy surveys has prompted the initiation of several 
projects with the goal of surveying thousands of square de- 
grees of sky in multiple filters with or without spectroscopy. 
The ground based projects [e.g. current KIDS, DES, and future 
Pan-STARRS, LSST in imaging and current SDSS-III/BOSS 
and future WFMOS BigBOSS in spectroscopy], and the space 
based missions [JDEM, and EUCLID, or their former and fu- 
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ture concepts] all propose to conduct wide field galaxy sur- 
vey (in imaging with or without spectroscopy) in order to ex- 
ploit the power of weak gravitational lensing [WL] and galaxy 
clustering (using baryonic acoustic oscillations [BAO] with 
or without redshift distortion measurements [RD]) to eluci- 
date the cause of the acceleration of th e Hubble expansio n 
dRiess et al.l(ll998h.lPerlmutter et al.l(ll999l) . lAstier et al.l d2006). 
iKilbinger et al.l d2009l) ). Proper design and forecasting of the 
performance of these experiments require an accurate estimate 
of the "yield" of galaxies, such as number counts, redshift sizes 
and color distributions, from a chosen survey configuration. It 
is typically straightforward to estimate the expected resolution 
and noise properties of the telescope and instrument, but more 
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difficult to quantify the number and properties of the "useful" 
galaxies available on the sky. A good forecast requires that we 
estimate the density of galaxies on the sky over the joint distri- 
bution of: 

(i) redshift; (ii) angular size, expressed as half-light radius ri/2; 
(iii) apparent magnitudes and colors in any chosen instrument 
passbands; (iv) emission-line strengths. 

The first three properties are essential to knowing whether a 
given galaxy will be detected at sufficient signal-to-noise (S IN) 
and resolution to determine its shape and its photometric red- 
shift. The last property, the emission-line strength, is essential 
to estimating the depth and completeness that any spectroscopic 
redshift survey will achieve. Spectroscopic redshift accuracy is 
needed to conduct the best possible BAO/RD measurements, and 
to calibrate photometric redshift ("photo-z") esti mators that are 
essent ial for WL tomographic measurement (e.g. iMassev et alj 
d2007h . 

In this paper we present two simulated catalogs of galaxy 
properties based on current deep surveys that we will use in 
forthcoming papers (Jouvel et al. 2009 in prep.) to forecast 
the performance of WL Dark Energy space based missions. 
Unfortunately we cannot simply use a catalog from some com- 
pleted survey, since (1) no single survey of useful depth has si- 
multaneously observed all of the listed properties for its target 
galaxies, and (2) the proposed surveys will exceed the depth, 
field of view, resolution, with or without wavelength coverage of 
most existing observed galaxy catalogs. For example, the space 
based spectroscopic survey will likely mainly be conducted in 
the near-infrared, surpassing any current infrared spectroscopic 
redshift survey. The mission concepts of SNAP, Destiny, and 
EUCLID propose near-infrared (NIR) imaging over very wide 
areas, but only a few squ are arcminutes of HST/NICMOS imag- 
ing data with the UDF dCoe et al.l l2006) and close to one degree 
down to K ~ 23 for grou nd based data with the UKIDSS survey 
on the UKIRT telescope dLawrence et al. 1 120071) are available to 
date at these magnitude levels, insufficient to serve as a robust 
source model. Any relevant simulated catalogs must therefore 
include some degree of extrapolation or modelling of the source 
population. 

In this paper, we explain how we have constructed 2 simu- 
lated galaxy catalog based on deep survey data. Section|2]explain 
the methodology used. In section [3] we validate these simulated 
catalogs by comparing their predicted magnitude, color, redshift, 
size, and emission-line strength distributions to real survey data 
taking into account the survey selection functions. 

Although this work was initiate d in the context of th e SNAP 
collaboration (see our first results in lDahlen et alj d2008l) ). it can 
easily be adapted to any instrument concept for a proper evalua- 
tion of its merit. When necessary we assume a ACDM universe: 
(£I m ,Qa) = (0.3,0.7) and H = 70 km/s/Mpc. All magnitudes 
used in this paper are on the AB system. 



2. Realistic mock Galaxy Catalog 

For both of our simulated catalogs, each galaxy is assigned a 
spectral energy distribution (SED), which can be integrated over 
any instrumental passband to forecast an apparent magnitude. 
Our first fully simulated catalog has been generated by using 
"Le Phare" simulation tool Q (Arnouts and Ilbert 2009 in prep) 
with an analytic lumi nosity function base d on the GOODS sur- 
vey from the work of Dahle n et alj d2005l) . We will refer to our 
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Fig. 1. Extended CWW library of SED templates. This repre- 
sents the full range of SED template linearly interpolated be- 
tween the 4 CWW templates and one star-forming template in 
AB magnitude (arbitrarily flux scaled at 4000A). 



derived simulated catalog as the GOODS Luminosity Function 
Catalog (GLFC). 

2.1. GLFC - GOODS Luminosity Function based Catalog 
2.1.1. SED Library 

The GLFC relies on two ingredients : 

- A set of SEDs spanning the entire range from Elliptical to 
starbusting galaxy, 

- A redshift evolving luminosity function (LF) per type. 

Those ingredients are the basic requirements to generate a cata- 
log that can predict global distributions such as magnitude, red- 
shift and color counts. We use the Coleman Extended library 
(CE) for our templa tes of galaxies. This lis t is based on the four 
observed spectra of Colema n et al.l d!980t) corresponding to an 
Elliptical, Sbc, Scd and Irregular, which have been extrapolated 
in the UV and IR wavelengths domain by using synthetic spectra 
from the GISSEL library (Chariot and Bruzual,1996). To repro- 
duce observed colors bluer than the CE templates, we add one 
spectrum of star-forming galaxies computed with the GISSEL 
model for solar metallicity, Salpeter IMF, constant star forma- 
tion rate and 0.05 G yr age. Following the approach adopted by 
ISawicki et alj d!996l) . we have linearly interpolated the original 
5 SEDs to provide a finer grid of spectral-type coverage with 
a total library of 66 templates shown in Figured] In Figure |2] 
we draw magnitudes of the 5 main templates in a magnitude- 
wavelength plane by bins of redshifts in order to show colors of 
these templates that may help in deciding which sensitivities are 
needed to detect galaxies inside different passbands. 
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Main SED : Ell, S b „, S ed , Irr, starburst 
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Fig. 2. Magnitudes of the 5 main templates composed by the 4 
CWW : Ell (red dotted), S be (gold dashed), S cc i (green long- 
dashed), Irr (cyan dot-dashed) and one star-forming (blue long 
dot-dashed) shown at some given redshift of 0.5,1,1.5,2,2.5,3. 
The templates are M* galaxies as given in Table[2] 

Table 1. Restframe colors of SED templates. 



LF 


E(B-V) 


types 


B — Wire st frame) 


1 





1 -> 17 


1.069^0.701 


2 





17 -> 55 


0.701 -> 0.2167 


3 





55 ^66 


0.2167 -> -0.0301 


3 


0.1 


56^66 


0.3252 ^0.1016 


3 


0.2 


56^66 


0.4592 -> 0.2345 


3 


0.3 


56 -> 66 


0.5945 -» 0.3686 


Main templates 


E(B-V) 


nbr 


B — V(rest frame) 


Ell 





1 


1.069 


S be 





21 


0.6262 


S cd 





36 


0.5077 


Irr 





51 


0.3185 


star — forming 





66 


-0.0301 



2.1.2. Extinction 

We have diversified our templates b y adding extinction com- 
puted from the Calzetti extinction law (Calz etti et al.l l2000) with 
the excess redenning values E(B - V) of 0,0.1,0.2,0.3 mag and 
using the extinction formula: 



fluX at tenuated(A) — fluXi ntr i ns i c {A).\Q 



-0.4*k(A).E(B-V) 



(1) 



The k(A) values follow the Calzetti law. Each E(B - V) value will 
be applied to the irregular and star-forming templates, increasing 
the number of templates for those types. Finally we also include 
the Lyman absorptio n by the I nterg alactic Medium as a function 
of redshift, following Madau d 1995b . 



2.1.3. GLFC Luminosity Function 

The GLFC is generated by assuming a luminosity function (LF) 
for galaxies of 3 different types, then drawing galaxies from 
these LFs to populate our simulated sky area. We have se- 
lected these 3 types following the B- V rest-frame colors given in 
iDahlen et all d2005l) (see tabled}- We start wit h the LF estimated 
in rest-frame B band by IDahlen et alj d2005l) in the 3 spectra l 
types inside 4 redshifts bins over < z < 1. IDahlen etaLld2005l) 
derive these LFs by fitting to photo-z data from the GOODS sur- 
vey. We have extrapolated these LFs to z = 6 to be complete 
for the galaxies likely to be used in future surveys. The Dahlen 
Z = 1 LF for each type is held nearly fixed for 1 < z < 6, with 
some adjustments made to improve the match to data from the 
COSMOS survey (described below). 

Table [2] gives the LF calculated for 3 galaxy types extrap- 
olated until redshift z — 6 as explained above. To produce the 
mock catalog, knowing the luminosity function, Le Phare de- 
rives a number of objects by magnitude and redshift bins (z, m) 
using a Schechter function (Schechter 1976) : 



n(M(z, m))dM = 



M(z, m) 
M* 



exp 



M(z, m) 
M* 



dM 
~M* 



(2) 



M is the absolute magnitude which is a function of redshift and 
apparent magnitude (z, m), M*,<p* and a are the parameters of 
the Schechter function given in the Table [2] 

According to the adopted area, the simulated galaxy counts 
in each magnitude-redshift-type bin are drawn from a Poisson 
distribution of the expected number based on the LF. 

Note that the luminosity function parameters depend on the 
cosmological model assumed (as the luminosity function is ex- 
pressed in physical units). However the resulting galaxy catalog 
can be considered to be independent of the input cosmological 
parameters assumed here, as it is built to reproduce the observed 
galaxy distribution and properties. 



Table 2. Luminosity Function in the B band extrapolated to 
match COSMOS colors and number counts used to create the 
GLFC catalog. 



LF 


z 


0* 


M* 


a 


1 


0.5 


16.7e-4 


-21.15 


-0.71 


1 


0.75 


18e-4 


-21.13 


-0.50 


1 


1.0 


9.5e-4 


-21.72 


-0.98 


1 


2.0 


8.5e-4 


-21.72 


-0.98 


1 


3.0 


7.5e-4 


-21.72 


-0.98 


1 


6.0 


6.5e-4 


-21.72 


-0.98 


2 


0.5 


24.0e-4 


-21.08 


-1.37 


2 


0.75 


25.0e-4 


-21.29 


-1.17 


2 


1.0 


25.0e-4 


-21.2 


-1.1 


2 


1.5 


24.4e-4 


-21.15 


-0.05 


2 


2.0 


24.4e-4 


-21.15 


-0.99 


2 


3.0 


24.4e-4 


-21.15 


-0.99 


2 


6.0 


24.4e-4 


-21.15 


-0.85 


3 


0.5 


48.6e-4 


-18.84 


-1.1 


3 


0.75 


33.1e-4 


-19.67 


-1.18 


3 


1.0 


33.1e-4 


-20.55 


-1.52 


3 


1.5 


30.0e-4 


-20.62 


-1.62 


3 


2.0 


25.0e-4 


-20.72 


-1.7 


3 


3.0 


25.0e-4 


-20.72 


-1.8 


3 


6.0 


25.0e-4 


-20.72 


-1.8 



Stephanie Jouvel et al.: Mock Galaxy Catalogs for DE 



4 

Drawing from the luminosity functions gives the galaxy dis- 
tribution over the joint magnitude-color-redshift space. We as- 
sign a half-light radius to each galaxy as follows: we first cal- 
culat e the galaxy's apparen t magnitude in the F814W HST fil- 
ter. iLeauthaud et al] d2007l) provides a size-magnitude catalog 
for the 1.64 deg 2 COSMOS survey executed in this filter. We 
assign the simulated galaxy a half-light radius that is drawn at 
random from all galaxies of the same F814W apparent magni- 
tude in the observed COSMOS catalog. Thus the size-magnitude 
distribution of the simulated catalog will, by construction, ex- 
actly match the COSMOS observations. The procedure does not 
reproduce any additional dependence of galaxy size upon type, 
color, or redshift using the COSMOS catalog. Any further corre- 
lation between size and other parameters such as color, redshift 
and galaxy type could in principle be implemented by following 
the COSMOS catalogue. As this is not needed in this paper, we 
have not implemented these higher order correlations. However 
the size-magnitude relation of the mock catalogue do follow the 
COSMOS relation. 

2.1.4. Photometric Noise 

A real survey has noise from the finite photon counts and de- 
tector noise. Estimation of this noise is of course essential for 
forecasting survey performance. It may also be important for 
validation of the simulation against real data, since the noise 
can induce biases on number counts. Le Phare produces both 
noiseless magnitudes and noisy magnitudes for each chosen ob- 
servation bandpass. The noisy magnitudes are randomly drawn 
from a Gaussian distribution with mean and standard deviation 
(m, err m ). We define a reference couple (m*,err*). At magnitude 
m < m*, the object noise is dominant and we assume that the 
magnitude error follows a power law of the adjustable form : 

err m = l (°-4(P+V(m-m')) (3) 

At magnitude m > m*, the sky noise is dominant and we assume 
the error on magnitude follows an exponential law : 

err m = err* /2.72.e*/>(10 (?(m_m * )) ) (4) 

(p, q) are the slopes for the power laws, both derived from the 
survey characteristics. 

We will, however, use noiseless magnitudes for the valida- 
tion tests in this paper, because the comparison surveys (e.g. the 
UDF) have high S /N in the regimes of comparison, and because 
the noise-induced biases are generally small. 

2.2. CMC - The COSMOS Mock Catalog 

The second simulated galaxy catalog is built directly from the 
obser ved COSMOS c atalog of Capak et al. (2009) in prep, 
and lllbert et all d2009t) . We will refer to this simulation as the 
COSMOS mock Catalog (CMC). 

2.2.1 . The COSMOS catalog 

The COSMOS photometric-redshift catalog (lllbert et al J l2009l) 
was computed with 30 bands over ~2-deg 2 taken from GALEX 
for UV bands, Subaru for the optical (U to z), and CFHT, UKIRT 
and Spitzer for the NIR bands. However, we restrict our mock 
catalog to the central square area of 1 .38 deg 2 which is fully cov- 
ered by HST/ ACS imaging. The COSMOS-ACS catalog gives 
592000 galaxies for an area of 1.38deg 2 . This is roughly a 
density of 120 galaxies per arcmin 2 down to i + < 26.5. In the 



COSMOS photometric-redshift catalog, 10% of this surface cor- 
responds to areas masked because of bright stars that prevent 
quality multiband photometry in the extended bright star halos. 
The effective area is thus in fact 1.24 deg 2 of unmasked re- 
gion with a total number of 538000 simulated galaxies out to 
i + < 26.5 leading again to roughly a density of 120 gal/arcmin 2 . 
Point sources such as stars and X ray sources (mostly dominated 
by an AGN) were also removed from the mock catalog. 

The photo-z accuracy is based on a comparison to spec- 
troscopic surveys like the zCOSMOS (bright survey down to 
Kned < ^2-5 with 4148 galaxies and faint down to i+, < 24 
with 148 galaxies) and also the MIPS infrared selec ted sample 
brigh t and faint with 317 galaxies, Figure 7 and 8 of lllbert et al] 
20091) ). The COSMOS photo-z accuracy is o- Az/(l+Zs) = 0.007 
at i + < 22.5 with a catastrophic rate below 1% (see Figure 6 of 
lllbert et al] d2009l) ). At fainter magnitudes and z < 1.25, the es- 
timated accuracy is cr Az = 0.02, 0.04, 0.07 at i + ~ 24, i + ~ 25, 
z' + ~ 25.5, respectively. The accuracy is degraded at i + > 25.5. 
The deep NIR and IRAC coverage enables the photo-z to be ex- 
tended to z ~ 2 albeit with a low er accuracy (1x^/(1+,^ = 0.06 at 
~ 24) (see lllbert et all d2009l ) for more details). 
Despite the lower photo-z accuracy at i + > 25.5 and z > 1 .25 
and the possible bias due to the faint AGN contribution, we use 
the full COSMOS catalog. Indeed, the photo-z accuracy is not 
crucial for the simulation. The COSMOS catalog is only used to 
obtain a representative population of galaxies in term of density 
and mix of galaxy types. Since the predicted apparent magni- 
tudes are calculated from the best-fit templates, the photo-z ac- 
curacy has no impact on our ability to link the simulated redshift 
to the predicted colors. The only risk of including lower qual- 
ity photo-z's is to degrade slightly the catalog representativity, 
possibly biasing the redshift distribution at z > 1.25. 

2.2.2. The mock catalog construction 

The principle of our simulation is to convert the observed prop- 
erties of each COSMOS galaxy into simulated properties that 
can then be viewed using any possible instrument configuration. 
A photo-z and a best-fit template (including possible additional 
extinction) are associated with each galaxy of the COSMOS cat- 
alog. 

The first step is to integrate the best-fit template (in the ob- 
server frame) through the instrument filter transmission curves 
to produce simulated magnitudes in the instrument filter set. 

The second step is to apply random errors to the simulated 
magnitudes, based on the magnitude-error relations established 
in each filter (see Sec tion l2. 1.4b . 

Importantly, all the COSMOS measured properties are prop- 
agated to the simulated galaxies, for instance, the galaxy half- 
light r adius as measured on the ACS images by ILeauthaud et all 
(120071) . 

This approach presents the following advantages: 

- the simulated mix of galaxy populations is, by construction, 
representative of a real galaxy survey, 

- additional quantities measured in COSMOS (like the galaxy 
size, UV luminosity, morphology, stellar masses, correlation 
in position) can be easily propagated to the simulated cata- 
log. 

The COSMOS mock catalog is limited to the range of magni- 
tude space where the CO SMOS imagi n g is co mplete (i^ B ~ 26.2 
for a 5<x detection, see ICapak etall (120071) and Capak et al. 
(2009) in prep. 
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2.2.3. Simulating emission lines 

For each galaxy of the COSMOS mock catalog we have asso- 
ciated emission line fluxes. This feature is useful to predict the 
size and the depth of a spectroscopic redshift sample. We mod- 
eled the emission line fluxes (Lya, [Oil], HB, [OIII] and Ha) of 
each galaxy as explained below. 

We used the m ethod described in Section 3.2 of lllbert et alj 
d2009l) . Using the iKennicuttl dl998l) calibration, we first esti- 
mated the star formation rate (SFR) from the dust-corrected 
UV rest-frame luminosity already measured for each COSMOS 
galaxy. The SFR can then be translated t o an [OH ] emis - 
sion line flux using another calibration from IKennicuttl d!998l) . 
We checked that the relation found between the [Oil] fluxes 
and the UV lumin osity is in good ag reement with the VVDS 
data (see Fig.3 of lllbert et alj ( 12009b ') and still valid for dif- 
ferent galaxy populations. For the other emission lines, we 
adopted intrinsic, unextincted flux ratios of [OIII]/[OII] = 
0.36; HMOII ] = 0- 2 8; Hq/rOII] = 1 . 77 an d Lyq/rOII] = 2 
(McC all et all d 19851). iMoustakas et al.1 d2006l) . iMouhcine etafl 
d2005l) . Kennicutt "dl998l) V The approach of determining the Lya 
line flux through its ratio with the Oil emission line flux is per- 
haps not good, but since the Lya line becomes visible to optical- 
NIR wavelength surveys only beyond z ~ 3, it will not have a big 
impact. Finally, we reduce each galaxy's line flux using the best- 
fit dust attenuation found with the template fitting procedure in 
the COSMOS photo-z catalog. 

The same procedure can be applied to the GLFC since we 
can calculate the UV absolute luminosity the same way as for 
the CMC. 

The COSMOS mock Catalog (CMC) has the advantage over 
the GLFC that it better preserves the relations between galaxy 
size and color (and presumably type and redshift). The CMC 
may also reproduce color distributions more accurately, since 
the population has not been reduced to three galaxy types as 
in the GLFC. The CMC is, however, limited by construction 
to the range of magnitude space where the COSMOS imaging 
is complete, whereas the GLFC can be extrapolated to fainter 
galaxies. Our validation tests below will compare the differ- 
ences of these two simulation approaches. It is important to 
stress that all the following galaxy number densities correspond 
to mask-corrected areas. Hence for real surveys, those numbers 
would have to be reduced by a factor of ss 10% as observed in 
COSMOS field. 



3. Validation 

The aim of these mock catalogs is to predict the performance 
of future surveys such as JDEM and EUCLID. The depth of the 
foreseen surveys may be significantly fainter than the deepest 
existing spectroscopic and NIR imaging wide field surveys, so 
a comprehensive observational validation of the catalog is not 
yet possible — especially in terms of NIR photometry and red- 
shift distribution. We should, however, prove that the GLFC and 
CMC are consistent with existing data. For this validation, we 
will compare the galaxy counts, color distributions, redshift dis- 
tributions, and emission-line distributions to a selection of the 
deepest available relevant survey data. The comparison data are 
taken from: 



- The HST Ultra-Deep Field (UDF) catalog dCoe et alj |2006) 
Q and included references covers 11.97 arcmin 2 in the 4 



HST/ACS filters F435W, F606W, F775W and F850LP, re- 
ferred to as B, V, i, and z bands, respectively, and 5.76 
arcmin 2 for the NICMOS filters Fl 10W and F160W, referred 
to as J and H bands. The UDF detects objects are at 10cr for 
Z < 28.43, i < 29.01 and J < 28.3, significantly fainter than 
the expected depth of any of the aforementioned proposed 
surveys. Due to the small area, the UDF is more sensitive to 
the cosmic variance, but it is useful for comparisons at faint 
magnitudes. 

- The GOODS (Great Observatories Origins Deep surveys) 
vl.l survey catalog Fl dGiavalisco et aT]|2004l) . The GOODS 
observations are split into northern and southern fields. Each 
field covers 160 arcmin 2 in the same BViz filters as the UDF, 
and is 90% complete at z ~ 26. 

- The GOODS-M USIC (MUltiwavelen gth Southern Infrared 
Catalog) sample (Grazia tTet alJ 1200 6) has been constructed 
with public data of the GOODS-S field, NIR Spitzer data 
from IRAC instrument (3.6, 4.5, 5.8 and 8.0 n m) and 
U-band data from the 2.2ESO and VLT-VIMOS covering 
140 arcmin 2 . This catalog is both z and Ks-selected and is 
90% complete for K s < 23.8 and z < 26. 

- The VVDS-DEEP first epoch public releas^3 includes pho- 
tometric data in BVRI V IRMOS-VLT bands over 0.49 deg 2 
dMcCracken et alj 20031): a spectro scopic survey of targets 
with I & ft < 24 dLe Fevre et al.ll2005h with a sampling rate of 
0.2 and a mean redshift of 0.86. 

- The V VDS-DEEP NIR J and K s photometry (llovino et al.1 
120051) covers 170 arcmin and is complete for Ks < 22.5. 
This sample contains the BVri VIRMOS-VLT band and 
the JKs ESO/NTT (New Techonology Telescope) using the 
SOFI Near Infrared imaging camera. 

A cut in the size-magnitude plane is applied to each dataset in or- 
der to remove stellar contamination, in case this has not already 
been done for published catalogs. 

For each of the above listed surveys, both a CMC and a GLFC 
are constructed as described above, projecting both mock cata- 
logs onto the real surveys' filter sets. 

3.1. Galaxy Counts 

We compare the simulated galaxy counts (dN/dm) to those in the 
real catalogs listed above. The SED of each simulated galaxy is 
integrated over the bandpass of each real catalog. 

In Figure|3]we compare the differential galaxy counts in the 
B, V, i, and z bands of the UDF and GOODS to those synthesized 
from the GLFC and CMC simulations. The blue solid line is the 
COSMOS mock catalog (CMC) and the black dot-dashed line 
is the Le Phare simulation based on the GOODS LF (GLFC). 
The real observations are: the UDF (red dotted); GOODS North 
(long-dashed green) and South (dashed magenta). The UDF 
counts in the NIR are also compared to the simulations. An ex- 
cellent agreement is seen between all sources for 21 < m < 26, 
where each is complete, with some tendency for the UDF counts 
to be lower than other surveys. We attribute this to cosmic vari- 
ance because of the very small UDF survey area. The difference 
between the simulated catalogs and the GOODS South is gener- 
ally less than the difference between GOODS North and South 
(w 15%). Note that the CMC becomes incomplete for m > 26 
(m > 25 in the 1.6 /mi band), so will underestimate the galaxy 
yield in surveys deeper than these limits. 



http://adcam.pha.jhu.edu/~coe/UDF/ 



http://archive.stsci.edu/prepds/goods/ 
4 http://cencos.oamp.fr/ 
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CMC(GOODS) 
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UDF 



VVDS -Wide Iovino 
VVDS-DEEP data 




20 25 20 

AB magnitude 

Fig. 3. Differential galaxy counts for F435W (B band), F775 W (i 
band), F850LP (z band), F160W (H band) and the Ks band com- 
paring mock catalogs to real observations of UDF and GOODS 
surveys. 



We also compare in Figure [3j the simulated Ks band counts 
to the GOODS-MU S IC an d VVDS data. The tf. s -band VVDS 
data of llovino etail d2005l) are 90% complete at I AB = 25.5. 
We see a very good agreement between the VVDS, GOODS- 
MUSIC and simulated catalogs at Ks < 20 and some discrep- 
ancies at fainter magnitudes. However, we measure a mean less 
than 17.9% difference between VVDS Iovino and CMC, 14.4% 
difference between VVDS Iovino and GLFC and 17.4% between 
GOODS-MUSIC and VVDS Iovino within the observation lim- 
its of these 2 surveys. We conclude that both simulated catalogs 
are well reproducing the Ks counts. 

3.2. Colors 

Figure|4]compares the B-z color distribution of the simulated 
catalogs to those of GOODS and UDF galaxies. The blue solid 
line is the COSMOS catalog (CMC) and the black dotted line is 
the GLFC catalog in GOODS filters. The real observations are: 
the UDF (dot-dashed red); GOODS North (long-dashed green) 
and South (dashed magenta). The indicated cut in S/N and mag- 
nitude are applied to F850LP (z Band) for each catalog. The 
magnitude cuts in the B and z bands are taken following the 
COSMOS completness in each of these bands. There is a good 
agreement in these optical wavelengths. The simulated catalogs 
have median and mean B-z colors in agreement at a few percent 
with the UDF and GOODS catalogs (see Table©. 

The UDF seems relatively deficient in the reddest galaxies, 
again perhaps a manifestation of cosmic variance, which is most 
severe for the highly-clustered red-sequence galaxies. This is an 
expected result and is not an issue with the catalogues since the 
UDF may be under-dense due to its small survey area. 

Figure |5] compares simulated B - Ks colors to those in the 
GOODS-MUSIC and VVDS-DEEP catalog. The blue solid line 
is the COSMOS mock catalog (CMC) and the black dotted line 




1 2 
color F435W-F8501p 

Fig. 4. Color histogram of F435W (B Band)-F8501p (z Band) 
comparing mock catalogs to real observation of UDF and 
GOODS surveys. 

Table 3. Mean and median of the B-z color distributions with 
the magnitude cuts of Figured] corresponding to the complete- 
ness of the CMC. 



Catalogs 


Mean 


Median 


survey area 


ma 


g limit 


CMC(GOODS) 


1.24 


1.14 


1.24 deg 2 


I 


-26 


GLFC(GOODS) 


1.2 


1.05 


0.1 deg 2 


R 


~ 31 


GOODS-S 


1.26 


1.1 


160 arcmin 2 


z 


-26 


GOODS-N 


1.3 


1.13 


160 arcmin 2 


z 


-26 


UDF 


1.16 


1.06 


1 1.97 arcmin 2 


z ~ 


28.43 



is the GLFC catalog. The observational data are the GOODS- 
MUSIC survey (top panel in dashed magenta) and VVDS-DEEP 
survey (bottom panel in dashed green). We choose to cut at 
22.5 AB mag in the Ks Band due to the VVDS incompletness 
and the variability of the GOODS-MUSIC survey area beyond 
this magnitude. The color distributions in NIR agree well inside 
the completeness limits imposed by the different surveys, the 
CMC z < 25.5 and B < 26.5 and the GOODS-MUSIC survey 
Ks < 22.5. The GOODS-MUSIC data have lower B-Ks counts 
which is explained by the number count difference at Ks > 20 
in the Ks-band between both simulated catalogs and the Grazian 
counts (see Figure[3j. However, the mean and median colors are 
in good agreement. The agreement is much better when compar- 
ing our mock catalogs to the VVDS-DEEP survey (bottom panel 
of Figure |5]l. 



3.3. Redshifts 

Comparison of the simulated redshift distributions to real data 
is likely less accurate than colors or magnitudes, since real red- 
shift surveys are much shallower and have significant incom- 
pleteness. Nonetheless, Figure |6] shows median and quartiles of 
the redshift distributions vs /-band magnitude of the CMC (blue 
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GOODS-MUSIC (S) 
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2 4 6 8 

color B-Ks (NTT SOFI) 

Fig. 5. Color histogram of F435W (B Band)-Ks comparing mock 
catalogs to real observation of GOODS and VVDS-DEEP sur- 
veys. 



medium-thickness line) and the GLFC catalogs (black thin line) 
in the VVDS filters compared to the VVDS-DEEP redshift dis- 
tribution (green high-thick line). The dotted lines represents the 
quartiles and solid lines the median of the redshift distribution. 

The redshift quartiles of the CMC agree very well with the 
VVDS spectroscopic redshift distribution to the 1-24 limit 
of the latter. The GLFC seems to have a lower median redshift, 
probably due to the GOODS LF used to produce the catalog. The 
mean difference of the median redshift is 0.02 and 0.07 for the 
CMC and GLFC catalogs, respectively. We conclude not sur- 
prisingly that the CMC is probably a better representation of 
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Fig. 6. Redshift distribution as a function of the VVDS i band 
magnitude compared to mock catalogs redshift distribution. 
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Fig. 7. Median of B-i color (VVDS bands) as a function of red- 
shift for GLFC and CMC mock catalogs and the VVDS-DEEP 
survey. 



the magnitude-redshift distribution, and is as accurate as current 
data can validate. 

Figure|7]plots median B-i color vs redshift for the simulated 
catalogs CMC (blue solid line) and GLFC (black dotted line) 
in the VVDS passbands compared to the VVDS-DEEP survey 
(green dashed line). All catalogs have been restricted to / < 24, 
the completeness domain of the VVDS-DEEP redshift survey. 
The colors agree to 0.1-0.2 mag until z ~ 2. Above this redshift 
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Fig. 8. Redshift distribution of the VVDS-DEEP survey, the 
COSMOS survey and the GLFC catalog from the GOODS lu- 
minosity function. 



the VVDS survey is likely to be highly incomplete, even for blue 
galaxies, as no strong emission lines are available in the VVDS 
spectral wavelength range and due to the / = 24 magnitude cut 
(see Figure |6]l for the median redshift and quartiles of the distri- 
bution. 

We compare the redshift distributions (Figure HJl for differ- 
ent magnitude cuts (different colors and thickness). There is a 
very good agreement beetween the VVDS-DEEP (dashed lines), 
the COSMOS survey (solid lines) and the GLFC (dotted lines) 
redshift distributions. 

The CMC and the VVDS-DEEP catalog show very good 
agreement. This figure shows that both mock catalogs are well 
reproducing the number density inside the VVDS volume : 
(0 < z < 1 .5 and / < 24) and agree at higher magnitude and 
r edshift. The GLFC is based on the GOODS luminosity function 
( dDahlen et al.ll2005l) ) which has equivalent depth to the VVDS 
galaxy sample. However when looking at Figure [6] the VVDS 
magnitude-redshift distribution seems to be in better agreement 
with the CMC than the GLFC. 



3.4. Emission-Line Strength 

Future dark energy surveys need large spectroscopic redshift 
samples for calibrating photometric redshifts, measuring accu- 
rately the BAO, and for other probes using spectroscopic sam- 
ples of galaxies. Thus it is crucial to have realistic emission lines 
allowing predictions of the success rate, depth, and size of spec- 
troscopic samples that we will need. Although, the CMC does 
not reproduce absorption lines, it simulates emission lines for 
all galaxies in the catalogue, which allows a first estimation of 
spectroscopic survey capabilities. The v alidation of t he em ission 
line strength can be found in Figure 3 of Ilb ertet al.1 d2009). This 



figure shows the relation betw een the Oil flu x and the rest-frame 
UV luminosity predicted by lKennicuttl (Il998h : 

log[OII] = -OAMuv + 10.57 - (5) 

It shows very g ood a greement between VVDS data from 
Lamareille et al. (2008) and the simulated emission lines 
strength extrapolated from the photometric redshift best fit 
template between 0.4 < z < 1.4 where the Oil emission line 
can be measured. Figure [9] shows th e UV-O II relation using 
the VVDS data from lLamareille et al.1 (120081) with 4 color cuts 
corresponding to the early types in red circles, intermediate 
types in orange triangle, late types in square green and starburst 
galaxies in blue stars. Criteria for the type selections follows 
the Dalhen prescription detailed in the Table Q] This shows that 
the correlation has no strong dependence on the galaxy types. 
The histograms shown in Figure [10] are the emission line ratios 




12 3 

-0.4xM(UV) - DM(z)/2.5 + 10.57 



Fig. 9. UV-OII correlation using 4 color cuts representing the 
early, inte rmediate, late and starb urst galaxies using the VVDS 
data from Lamareill e et al.l (120081) . 

of the CMC compared with those of VVDS. The attenuation 
causes a spread on the emission line ratios but we had to add 
a Gaussian dispersion to fit the VVDS emission line spread. 
If r„ is a set of n numbers randomly drawn from a Gaussian 
distribution (/z, o 2 ) = (0, 1), we define the new fluxes as : 



Ha = Ha(l + |r /4|) (6) 

H/5 =Hfi(l + |r () /2|) (7) 

[<9///] 4 959 = [0///] 4 959(l + 11-2^1) (8) 

[<9///] 50 o7 = [0///W1 + N) (9) 



We use the same random number ro to spread Ha and Wf3 in or- 
der to preserve the ratio of these lines. 

The spec tral wavelength ran ge of VVDS spans from 0.55 to 
0.94yum dLe Fevre et al.l2 005) making the [Oil] line visible from 
redshift 0.5 to 1.5 and the Ha line visible from redshift to 0.5. 
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Thus we choose to evaluate the validity of the Ha fluxes using 
the ratio Ha /Hp, known as the Balmer decrement and having a 
value ~ 2.9. The spread of the CMC emission line ratios fits well 
the emission line spread observed in the VVDS data. 

scaled lines ratio histogram 
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Fig. 10. Emission line ratios for the CMC catalog in black solid 
line compared to VVDS ratios in magenta dashed line. The cyan 
dotted line is the value of the theoretical emission line ratios. 



For a further validation of the CMC emission lines we 
choose to reproduce the spectroscopic success rate (SSR) of the 
VVDS survey. To realise the CMC(VVDS) SSR we have extrap- 
olated the flux sensitivity of the VVDS survey as a function of 
wavelength for several signal-to-noise ratio based on the [Oil] 
emission lines. Using these sensitivities and the CMC emission 
line catalog we derive the simulated SSR for the VVDS survey 
in order to validate the emission line model of the CMC catalog 
(Figures QT| and Q~2]i . Figures HT1 show the SSR for the emission 
lines of the CMC catalog compared to those of the VVDS as 
a function of redshift and magnitude. The top panel shows the 
SSR of emission lines as a function of redshift, dashed lines for 
the VVDS survey and solid lines for the CMC. The blue lines 
are for the Oil lines, red for Ha, cyan for H/3, gold for OHIa at 
4959A and purple for OIII at 5007 A using a growing thickness 
for each one in that order. The bottom panel shows the same as 
the top panel as a function of magnitude. The strong emission 
lines, Ha and Oil, are in very good agreement with the VVDS 
both in magnitude and redshift space. However weaker emission 
lines have a discrepancy of 10 to 20 % compared to the VVDS 
emission lines at low magnitudes, especially for the [OIII] lines 
at 4959 A and 5007 A, but it agrees very well at magitudes / > 2 1 . 
Figures [T2] compare the overall SSR predictions of the CMC 
with the VVDS secure and very secure redshifts. The top panel 
shows the VVDS SSR for secure redshift (brown thin lines) and 
very secure redshift (orange thick line) as a function of redshift 
compared to the CMC-VVDS emission lines 3cr flux detection 
(grey thin lines) and 2 lines at 5cr detection or Oil at 5<x detection 
with V < 24 (magenta thick lines). The bottom panel shows the 
same as the bottom panel as a function of magnitude. We see that 
the VVDS secure redshifts success rate is very close to the 3<x 
detection line of the CMC catalog. In the same way, the VVDS 
very secure redshift success rate agrees very well with the 2 lines 
detection at 5cr. This includes galaxies with V < 24, which cor- 
responds to a detection of the continuum in the VVDS spec- 



100 



c 



OS 

m 
m 



40 



■ \ ~ ' ^ ' v 

80 I— »' 1 *\ rt VI ' - 1 

_ if ^ •> 

en L '. "/' »v - " » 
< llj V v II 



i air - . 



hi 

20 h" 

1 III, 
.< 

.1—1 



t 

•v 

»'\ II 
n T-i 
n ii ' 
IV; 
l V 



■ 

i .ISM 



0.5 



100 



1 

redshift 



S/N(0II)>3 
S/N(0IIIa)>3- 
S/N(0IIIb)>3 
S/N(Ha)>3 - 
S/N(Hb)>3 

VVDS ' 

CMC 



_1_ 

1.5 



i 1 1 r 




32 

I magnitude 



Fig. 11. Comparison of the lines Spectroscopic Success Rate 
(SSR) of the VVDS survey with the simulated lines VVDS SSR 
using the CMC emission lines. 
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tra with a 5/JV > 10 in the blue part of the spectrum (which is 
free of strong sky emission lines). The differences mainly comes 
from the redshifts effectively obtained using absorption lines for 
which the shape of the continuum altogether does not match ex- 
actly with our ad-hoc V-band magnitude criterion. Although the 
CMC does not simulate the absorption lines, we can say that 
the CMC emission line shows a good agreement both in redshift 
and magnitude with the VVDS spectroscopic survey. These re- 
sults makes us confident in using the CMC catalog as a tool to 
predict the SSR of future wide field spectroscopic surveys. 

4. Discussion 

Different cosmological tests have been proposed to probe the ge- 
ometry and growth of structures in order to shed new light on the 
nature of dark energy. The best approach is certainly to combine 
different probes to reduce the errors and better understand the 
systematics. However, each probe has its own requirements and 
it is a technical and scientific challenge to design a telescope op- 
timised for more than one cosmological probe. Nonetheless, the 
goal of future dark energy programs should find a survey strate- 
gies that will lead to the best combined efficiency of different 
probes. 

Having a realistic description of galaxy properties in our 
Universe is key to properly forecast what future deep and wide 
surveys can achieve for a given survey configuration. In this sec- 
tion, we start a discussion with requirements for a dark energy 
survey that combines shape measurements and a photo-z cali- 
bration survey (PZCS) for weak lensing with a BAO survey. 

Thus, we will discuss two important aspects of these future 
dark energy mission: (i) the impact of galaxy size in terms of 
shape measurement for weak lensing depending on the PSF size 
( ii) the different typical WL and BAO survey configurations we 
have tested using our mock catalogues and ( Hi) their expected 
spectroscopic success rate. 

Note that we are not conducting here any optimisation of 
such a space mission and its survey strategy. We are just putting 
in place some of tools necessary for such an optimisation. 



4.1. Galaxy-sizes 

A comparison of the simulated galaxy-size distribution to ob- 
servational data is not particularly useful. Indeed the simulated 
catalogs agree, by construction, with the size distribution mea- 
sured i n the COSMOS ACS i maging survey dKoekemoer et al.l 
I2007I) . dLeauthaud et al. 2007). No other survey capable of re- 
solving faint galaxies approaches the size of the COSMOS/ACS 
data. In Figure [13] we show the cumulated half-light radius dis- 
tribution for the COSMOS data by bins of magnitude (different 
colors and thickness). 

The brown area corresponds to the typical PSF size of 
ground-based telescopes. It shows that we can only resolve and 
measure galaxy shapes (those galaxies with a size larger than 
the PSF size) for about 15gal/arcmin 2 (or up to 20gal/arcmin 2 in 
case of excellent seeing) as soon as a depth of / ~ 25 is reached. 
Going deeper than I = 25 will not help to raise this number den- 
sity as fainter galaxies are smaller than the PSF, thus making a 
shape measurement extremely difficult. 

The grey area corresponds to the PSF size expected with fu- 
ture space Dark Energy mission; the exact value of the PSF size 
depends on the telescope diameter and the chosen pixel scale 
(here we assume a telescope diameter of ~ 1.5-1.8 meter). 
Interestingly, with this small PSF size, the number density of re- 
solved galaxies does increase as a function of the depth of the 
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Fig. 13. Comparison of a ground-based and space weak-lensing 
survey using the COSMOS cumulated half-light radius distibu- 
tion by bins of magnitude. 



survey. Beyond / = 26 the increase in number density is how- 
ever smaller, suggesting that there is a limited gain to go much 
deeper than / = 26 except perhaps for telescope designs with the 
smallest PSF size. With a depth of / ~ 25.5-26 the number den- 
sity of resolved galaxies in the foreseen space surveys is about 
3x larger than what ground-survey can achieve. 

Contrary to the ground case, the performance of a space sur- 
vey in terms of galaxy number density critically depends both 
the depth and the resolution of the images. The black dashed ver- 
tical line correspond to the PSF siz e of the ACS/HST cam era as 
measured in the COSMOS survey iLeauthaud et all d2007t) . The 
COSMOS survey reach a number density of resolved galaxies 
of ~ 1 lOgal/arcmin 2 down to / ~ 26.5. It is important to rec- 
ognize however that not all these galaxies could be used in the 
COSMOS 3D weak lensing analysis, as explained in Leauthaud 
et al (2007). For a fraction of these galaxies there was no coun- 
terpart in the ground-based catalogue because of masking of the 
data. For the remaining galaxies, it was not always possible to 
determine securely the shape or the redshift thus decreasing fur- 
ther the number of usable galaxies to roughly 40gal/arcmin 2 . 

Cosmological weak lensing survey aim to maximize the 
number of resolved galax ies that can be used for we ak lensing 
tomography. As shown by Amara &Refregiei(l2007|), it is more 
efficient to conduct a wider survey than a deeper survey as the 
galaxy number increase is larger for a given exposure time by go- 
ing wide than by going deep. However, the increase of Galactic 
absorption and stellar density, when getting closer to the Milky 
Way plane, will limit the gain of going wider than deeper when 
surveying areas larger than -15 000 sq.deg. 

4.2. BAO and WL spectroscopic survey requirements 

We investigate here three types of spectroscopic redshift 
surveys, as can be seen in Figure [14] and Table [4), whose 
characteristics correspond to typical requirements for the BAO 
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and WL probes. 

(i) BAO aims to measure the baryon accoustic oscillation 
peak of the 2-pt correlation function as a measure of a stan- 
dard ruler. It requires a large survey ar e a to reduce the sta- 
tistica l noise (iGlazebrook & Blakd 120051; iBlake & Glazebrookl 
2003) and an accurate redshift for each galaxy, typically using 
spectroscopic techniques to reach a higher accuracy. One can 
use photometric redshifts instead of spectroscopic redshifts, but 
the loss of line-of-sight information then requires a few times 
larger area to reach the same dark energy figure of merit (FOM) 
Glazebrook & Blake (2005). Following this last paper, we in- 
vestigate a WIDE near infrared (1.0 to 1.7 micron) spectro- 
scopic survey reaching a 3<x sensitivity of lxl0~ l6 ergs cm~ 2 s~ x 
at 1 .2//m (this can be achieved with an efficient slitless spectro- 
graph using a 1.5m telescope, 0.28"/pixel, R~500 and an expo- 
sure time of 1200 sec) and covering a large area on the sky to 
maximise the dark energy FOM. The flux sensitivities of such 
a survey are represented by the solid cyan (and high thickness) 
line in Figure H4l 

(ii) The weak-lensing tomographic analysis is likely the 
most effic ient lensing method to estimate the dark ener gy pa- 
rameters dMassev et alj|2007t lAmara & Refregierl 12007). This 
method probes the growth of structures but needs to have ac- 
curate redshifts to place the background sources at their correct 
location. To achieve this measurement, it is essential to deter- 
mine with the best accuracy (and minimal biases) the photo- 
metric redshift (photo-z) of all the galaxies to b e used in the 
weak lensing tomography. iMa & B ernstein ( 200jl) have shown 
that a high accuracy photo-z is needed to avoid any bias on 
the Dark Energy parameter estimation. The only way to reach 
high accuracy photo-z is to plan a spectro scopic redshift sur - 
vey to calibrate the photo-z templates, as in lllbert et all d2006). 
Ideally, the photo-z calibration survey (PZCS) would reach the 
same magnitude and redshift depth as the WL photometric sur- 
vey. However, this goal will likely be difficult to achieve and 
the CMC emission-line catalog may help to plan the best spec- 
troscopic strategy. Thus, we design a DEEP-visible-NIR survey 
(0.6 to 1.7 microns) for weak-lensing, reaching a 3cr flux sen- 
sitivity of 5.\0~ li ergs cmr 2 s~ x at 1.2yum (this can be achieved 
with an efficient slitless spectrograph using a 1.5m telescope, 
0.28"/pixel, R~250 and an exposure time of 240 ksec~67 hours) 
as an attempt at calibrating photometric redshifts for a wide 
range in redshift and magnitude. The flux sensitivities of such a 
survey are represented by the solid red (and medium thickness) 
line for a 5<x detection and by the solid black (and thin) line for 
a 3cr detection in Figure H4l 

(Hi) We also desgin a DEEP-NIR survey having the same 
characteritics as the DEEP-visible-NIR survey but covering only 
the NIR part (1.0 to 1.7 micron; focussing on the low IR back- 
ground of space observation). Our goal here is to evaluate the 
importance of having a spectroscopic contribution in the visible 
wavelength to reach a high completness. 

In Figure H4l we also represent the VVDS-DEEP flux sen- 
sitivities in green (and thin) dashed line for a 3cr detection and 
gold (and thick) dashed line for a 5cr detection. For reference, the 
exposure time of the VVDS-DEEP survey is ~10ksec exposure 
on a 8m ground-based telescope with a spectroscopic resolution 
ofR=250. 



4.3. SSR prediction 

In the following sections we will give the basics of photo-z cal- 
ibrations survey studies based on the spectroscopic success rate 



as a function of redshift and magnitude. A more thorough anal- 
ysis will be developed in a forthcoming paper (Jouvel et al 2009 
in prep). 




wavelength in microns 

Fig. 14. Flux sensitivities for the VVDS-DEEP survey as a func- 
tion of wavelength compared to forecast of future space WIDE 
and DEEP NIR [purple and shaded area] surveys with or without 
visible coverage. 
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4.3.1 . SSR for the WIDE survey 

FiguresHH shows SSR forecasts for a future NIR WIDE survey. 
Top panel represents the SSR as a function of redshift and bot- 
tom panel as a function of magnitude. Both panels show the SSR 
using a one line at 5<x detection criterion in orange very thick, a 
one line at 3<x detection criterion in grey thick , and a continuum 
S/N>10 criterion in purple thin. The solid curves are a percent- 
age with the total number of objects and the dashed for a cumu- 
lative number of objects by arcmin 2 . The top panel shows also 
the SSR of a 3<x detection of the Oil emission line in blue long- 
dashed thin line and Ha in red long-dashed thick line. The wide 
survey would aim to cover a large fraction of the sky (10,000 
deg 2 or more) in order to probe the large scale distribution of 
galaxies aiming in particular to measure the baryon acoustic os- 
cillation with a great accuracy in the redshift range 0.5 < z < 1 .5 
(or up to z ~ 2 providing the telescope and instrument is sen- 
sitive up to 2/im). Our SSR prediction, shows that such survey 
could easily measure the redshift of more than 4 galaxy/armin 2 
hence providing the redshift measurement of more than ~100 
million galaxies for a survey covering 10,000 deg 2 . The redshift 
identification is essentially based on the Ho- line detection thus 
essentially targeting star-forming galaxies with H < 22.5. The 
SSR clearly shows that such survey is far from being complete 
as the spectroscopic success rate is generally below 40% for any 
magnitude and redshift (for a 5<x line detection). Indeed, there 
are very few faint galaxies with spectroscopic success, especially 
in the redshift ranges which have the strongest need for photo- 
metric redshift calibration : < z < 0.5 and z > 1.5. Table [4] 
shows that the SSR also depends strongly on galaxy type. Thus, 
the wide survey can not be used to calibrate any photometric 
redshift catalogue properly. 



4.3.2. SSR for the DEEP survey 

Figures [16] and [l7]show the SSR prediction for future dark en- 
ergy surveys which are planing to do infrared only (DEEP-NIR 
survey), and visible +infrared spectroscopy (DEEP-visible-NIR 
survey) from space. Figure [14] shows the grism flux sensitivities 
we use in our analysis. It shows SSR as a function of redshift 
[top panel] and magnitude [bottom panel]. Both panels show the 
SSR using a 2 lines detection with at least one line at 5cr de- 
tection criterion in orange very thick, a one line at 3<x detection 
criterion in grey thick, and a continuum S/N> 10 criterion in pur- 
ple thin. The solid curves are a percentage with the total number 
of objects and the dashed for a cumulative number of objects by 
arcmin 2 . The top panel shows also the SSR of a 3<x detection of 
the Oil emission line in blue long-dashed thin line and Ha in red 
long-dashed thick line. 

We assume here that the visible spectroscopy spans 0.6-1^/m 
see Figure [14] Because of the limited wavelength coverage, the 
Ha is the only strong emission line visible for galaxies at low 
redshift (z<0.5). This explains that the detection criterion based 
on 2 lines, does not cover the low redshift range as shown in the 
top panel of Figure [16] This is also seen in the bottom panel 
of Figure [16] where there is a lower SSR for bright galaxies. 
However, for these low redshift (z < 0.5) galaxies, about 30% 
have a strong continuum with S/N> 10. For these bright galaxies, 
we should be able to accurately measure a redshift using either 
the Ha line detection, absorption lines and the shape of the con- 
tinuum which will be useful for photometric redshift calibration. 

If only the NIR spectroscopy is conducted from space, the 
survey will only reach ~ 35galaxy per arcmin 2 and a maximal 
SSR of at best 40% for / ~ 24. Using DEEP-NIR spectroscopic 
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survey would help to calibrate spectra at 0.8 < z < 2.5 but would 
be less efficient below z < 0.8. There will be a strong need to 
conduct visible spectroscopy to achieve a complete census of 
the redshift distribution. 

Using the DEEP-visible-NIR survey, we should be able to 
measure a very secure redshift (2 lines detected at more than 
3cr with at least 1 emission line at 5cr) for about 60 galaxy per 
arcmin 2 to / < 26 and z < 2.5 with a SSR reaching 80% at best 
for / ~ 24. With this density of sources, a survey of 1 square 
degree will provide 220000 spectra. Note, however, that these 
numbers do not take into account the crowding of galaxy spec- 
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tra which is likely a strong limiting factor above / ~ 24. There 
are ways to minimize the impact of crowding, for example by 
conducting the slit-less spectroscopic survey using different ori- 
entations, or by using masks to block some of the light (thus 
limiting the sky background and reducing the number of over- 
lapping spectra). These alternatives will be discussed in a forth- 
coming paper (Zoubian et al. 2009 in prep. ). Note also that the 
visible part of the spectroscopy might be conducted with a ded- 
icated ground-based spectrograph with high multiplexing. This 
can probably be especially more efficient for A < 0.75/im where 
the sky emission lines are less numerous than in the redder part 
of the spectrum. 



Table 4. Characteristics of the three surveys discussed in the 
text, assuming an efficient 1.5m space telescope with an obscu- 
ration of 0.6m and a total telescope throughput (CCD, mirror, 
grism) of 70% using a survey efficiency of 75%. The %[lline3cr] 
Ell represent the percentage of Elliptical galaxies spectroscopi- 
cally found using a success criterion of 1 line detected at 3<x 
compared to the total population of Elliptical galaxies. With the 
same criterion, we define the percentage of Sac for early spiral 
galaxies, Sdm for late spiral galaxies, and SB for starburst galax- 
ies. 



probe 

fobs 

A in /im 
3er at 1.2/zm 

area needed 
mission time 

nbr density (5<x) 

z[lline3<r] 
m ; , m [lline3<r] 
%[lline3o-] Ell 
%[lline3cr] Sac 
%[lline3cr] Sdm 
%[lline3cr] SB 



DEEP-vis-NIR 
WL 
240ksec 
0.6-1.7 
5.KT 18 

lOrfeg 2 
0.2yrs 

60 

< z < 3.5 
I~ 27 
40 
60 
77 
95 



DEEP-NIR 
WL 

240ksec 
1-1.7 

5.10- 18 
ergs crrr 2 sec~ l 

\0deg 2 

0.2yrs 
FOV of Q.Sdeg 2 
35 

gal I arcmirr 
0.5 < z < 3.5 
I~ 26.5 

27 

51 

63 

74 



WIDE-NIR 
BAO 
1200sec 
1-1.7 
1.10 16 

20000deg 2 
2yrs 



0.5 <z< 1.5 
H~ 22 

1 

10 

45 



5. Conclusion 

We have produced simulated deep galaxy catalogs by two dif- 
ferent techniques: one starts with redshift-dependent Schechter 
luminosity functions for three galaxy types for < z < 6, ex- 
trapol ating the LFs derived from GOODS data by Dahle n et al.l 
(2005) (the GLFC). The other has galaxies with redshifts and 
SEDs taken as the best photo-z fits to multiwavelength obser- 
vations of the COSMOS field (the CMC). Both adopt galaxy 
size distributions from the COSMOS HST imaging and we as- 
sign emission line strengths using a recipe in agreement with the 
VVDS-DEEP data. 

Both simulated catalogs do an excellent job of reproduc- 
ing the dN/dm data (< 20% discrepancies) and color distribu- 
tions (0.1-0.2 mag median color discrepancies) of galaxies ob- 
served in bands from 0.4 to 2.2 jum using a comparison with 
the GOODS and UDF surveys. The CMC provides an excellent 
match to the redshift-magnitude and redshift-color distributions 
for / < 24 galaxies in the VVDS spectroscopic redshift survey. 



These simulated catalogs thus pass all our validation tests 
for use in forecasting the galaxy "yields" of future visible/NIR 
imaging surveys, as long as we restrict our analysis to the / < 
25.5-26 galaxies for which the COSMOS HST imaging is highly 
complete. These simulated catalogs provide a conservative esti- 
mate of the yield for surveys that go deeper in the visible, or 
reach > 24 in the NIR, because they miss the faint or red galax- 
ies that do not make the /-band cut. 

The GLFC has the potential to forecast deeper surveys than 
the CMC, but only if we are ready to extrapolate the luminosity 
functions to fainter magnitudes, which is probably feasible in 
the visible bands, but may be more strongly limited in the near- 
infrared bands. 

In the CMC, the galaxy sizes are directly coming from the 
HST/ACS measurement and give us the opportunity to evalu- 
ate differences in terms of possible shape measurements for a 
ground-based and space telescopes. A ground based WL survey 
is limited to use only the resolved galaxies, and we show that 
most I > 25 galaxies are smaller than the ground-based seeing 
disk. Thus the WL depth of ground-based surveys is basically 
independent of the survey depth for I > 25. For a space survey 
the number of resolved galaxies is not only dependent on the 
PSF but also on the depth of the survey. 

Moreover we have also used the CMC and the emission line 
= model to compare the spectroscopic success rate observed by 
the VVDS and the one we can predict based on the CMC line 
fluxes. In general we obtain a good agreement between observed 
success rates and the CMC predictions. 

We have then used the emission line model to explore what 
will be the yield in terms of redshift measurements for 2 type 
of surveys: (i) a WIDE survey motivated by measuring BAO, 
and (ii) a deep survey motivated to calibrate photometric red- 
shifts. We found that the WIDE survey reaches a density of 
4gal/arcmin 2 for galaxies H < 22.5 in a redshift range of 
0.5 < z < 1.5 mainly detected using the Ha emission line. 
Furthermore, in a weak-lensing perspective, a DEEP survey both 
visible and NIR is needed to conduct a photometric redshift cal- 
ibration of galaxies (PZCS) to be used in a weak-lensing mea- 
surement covering the full magnitude and redshift range. Indeed 
such survey reaches a density of 60 gal/arcmin 2 for very secure 
redshifts down to / ~ 25.5 with a high completeness of ~80%. 
We note that the visible part of the spectroscopic survey can be 
done from the ground, however above A ~ 0.75^/m a space sur- 
vey is likely to be much more efficient than a ground-based sur- 
vey (see Figure [T4l. 

We have thus demonstrated here how useful realistic simu- 
lated catalogs are for designing future DE space missions. We 
will investigate more precisely details of the WL optimisation 
survey strategy in a forthcoming paper (Jouvel et al. 2009, in 
prep. ). 
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ABSTRACT 

Context. Future dark energy space missions such as JDEM and EUCLID are being designed to survey the galaxy population to 
trace the geometry of the universe and the growth of structure, which both depend on the cosmological model. To reach the goal of 
high precision cosmology they need to evaluate the capabilities of different instrument designs based on realistic mock catalog of the 
galaxy distribution. 

Aims. The aim of this paper is to construct realistic and flexible mock catalogs based on our knowledge of galaxy population from 
current deep surveys. We explore two categories of mock catalog : (i) based on luminosity functions fit of observations (GOODS, 
UDF,COSMOS,VVDS) (ii) based on the observed COSMOS galaxy distribution. 

Methods. The COSMOS mock catalog benefits from all the properties of the data-rich COSMOS survey and the high accuracy 
photometric redshift distribution based on 30 bands photometry. Nevertheless this catalog is limited to the depth of the COSMOS 
survey. Thus, we also evaluate a mock galaxy catalog generated from luminosity functions using the Le Phare software. For these two 
catalogs, we have produced simulated number counts in several bands, color diagrams and redshift distribution for validation against 
real observational data. 

Results. Using these mock catalogs we derive some basic requirements to help designing future Dark Energy mission in terms of 
number of galaxies available for the weak-lensing analysis as a function of the PSF size and depth of the survey. We also compute 
the spectroscopic success rate for future spectroscopic redshift surveys (i) aiming at measuring BAO in the case of the wide field 
spectroscopic redshift survey, and (ii) for the photometric redshift calibration survey which is required to achieve weak lensing 
tomography with great accuracy. In particular, we demonstrate that for the photometric redshift calibration, using only NIR (l-1.7yum) 
spectroscopy we can not achieve a complete spectroscopic survey down to the limit of the photometric survey (/ < 25.5). Extending 
the wavelength coverage of the spectroscopic survey to cover 0.6-1.7/Ym will then improve the fraction of very secure spectroscopic 
redshift to nearly 80% of the galaxies making then possible a very accurate photometric redshift calibration. 

Conclusions. We have produced two realistic mock galaxy catalogs that can be used in studying the best survey strategy of future 
dark-energy missions in terms of photometric redshift accuracy and spectroscopic redshift surveys yield. These catalogues are publicly 
accessible at http://lamwws.oamp.fr/cosmowiki/RealisticSpectroPhotCat, or by request to the first author of this paper. 

Key words, redshift - JDEM - cosmology - surveys - mock catalog 



1. Introduction 

The prospect for high-precision cosmological inferences from 
large galaxy surveys has prompted the initiation of several 
projects with the goal of surveying thousands of square de- 
grees of sky in multiple filters and/or spectroscopy. The ground 
based projects [e.g. current KIDS, DES, and future Pan- 
STARRS, LSST in imaging and current SDSS-III/BOSS and fu- 
ture WFMOS BigBOSS in spectroscopy], and the space based 
missions [JDEM, and EUCLID, or their former and future con- 
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cepts] all propose to conduct wide field galaxy survey (in imag- 
ing and/or spectroscopy) in order to exploit the power of weak 
gravitational lensing [WL] and galaxy distribution (using bary- 
onic acoustic oscillation [BAO] and/or the redshift distortion 
measurement [RD]) to elucidate the cause of the acceleration 
of the Hubble expansion (Riess et al. (1998), Perlmutter et al. 
(1999), Astier et al. (2006), Kilbinger et al. (2009)). Proper de- 
sign and forecasting of the performance of these experiments 
requires an accurate estimate of the "yield" of galaxies, such as 
number counts, redshift size and color distribution, from a cho- 
sen survey configuration. It is typically straightforward to esti- 
mate the expected resolution and noise properties of the tele- 
scope and instrument, but more difficult to quantify the number 



and properties of the "useful" galaxies available on the sky. A 
good forecast requires that we estimate the density of galaxies 
on the sky over the joint distribution of: 

(i) redshift; (ii) angular size, expressed as half-light radius rip.; 
(iii) apparent magnitudes and colors in any chosen instrument 
passbands; (iv) emission-line strengths. 

The first three properties are essential to knowing whether a 
given galaxy will be detected at sufficient signal-to-noise (S IN) 
and resolution to determine its shape and its photometric red- 
shift. The last property, the emission-line strength, is essential 
to estimating the depth and completeness that any spectroscopic 
redshift survey will achieve. Spectroscopic redshift accuracy is 
needed to conduct the best possible BAO/RD measurements, and 
to calibrate photometric redshift ("photo-z") estimators that are 
essential for WL tomographic measurement (e.g. Massey et al. 
(2007)). 

In this paper we present two simulated catalogs of galaxy 
properties based on current deep surveys, that we will use in 
forthcoming papers (Jouvel et al. 2009 in prep.) to forecast 
the performance of WL Dark Energy space based mission. 
Unfortunately we cannot simply use a catalog from some com- 
pleted survey, since (1) no single survey of useful depth has si- 
multaneously observed all of the listed properties for its target 
galaxies, and (2) the proposed surveys will exceed the depth, 
field of view, resolution, and/or wavelength coverage of most 
existing observed galaxy catalogs. For example the space based 
spectroscopic survey will likely mainly be conducted in the near- 
infrared, surpassing any current infrared spectroscopic redshift 
survey. The mission concepts of SNAP, Destiny, and EUCLID 
propose near-infrared (NIR) imaging over very wide areas 
but only a few square arcminutes of HST/NICMOS imaging 
data with the UDF (Coe et al. 2006) and close to one degree 
down to K ~ 23 for ground based data with the UKIDSS sur- 
vey on the UKIRT telescope (Lawrence et al. 2007) are avail- 
able to date at these magnitude levels, insufficient to serve as 
a robust source model. Any relevant simulated catalogs must 
therefore include some degree of extrapolation or modelling of 
the source population. 

In this paper, we explain how we have constructed, based on 
deep survey data, simulated galaxy catalog. Section 2 explain 
the methodology used. In section 3 we validate these simulated 
catalogs by comparing their predicted magnitude, color, redshift, 
size, and emission-line strength distributions to real survey data 
taking into account the survey selection functions. 

Although this work was initiated in the context of the SNAP 
collaboration (see our first results in Dahlen et al. (2008)), it can 
easily be adapted to any instrument concept for a proper evalua- 
tion of its merit. When necessary we assume a KCDM universe: 
(Aw,£_ A ) = (0.3,0.7) and H = 70 km/s/Mpc. All magnitudes 
used in this paper are on the AB system. 

2. Realistic mock Galaxy Catalog 

For both of our simulated catalogs, each galaxy is assigned a 
spectral energy distribution (SED), which can be integrated over 
any instrumental passband to forecast an apparent magnitude. 
Our first fully simulated catalog has been generated by using 
"Le Phare" simulation tool 1 (Arnouts and Ilbert 2009 in prep) 
with an analythic luminosity function based on the GOODS sur- 
vey from the work of Dahlen et al. (2005). We will refer to our 
derived simulated catalog as the GOODS Luminosity Function 
Catalog (GLFC). 

1 Arnouts http://www.oamp.fr/people/arnouts/LE_PHARE.html 
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Fig. 1. Extended CWW library of SED templates. This represents the 
full range of SED template lineraly interpolated between the 4 CWW 
templates and one star-forming template in AB magnitude (arbitrarily 
flux scaled at 4000 A). 

2.1. GLFC - GOODS Luminosity Function based Catalog 

2.1.1. SED Library 

The GLFC relies on two ingredients : 

- A sub-class of SED spanning the entire range from Elliptical 
to starbusting galaxy, 

- A redshift evolving luminosity function (LF) per type. 

Those ingredients are the basic requirements to generate a cata- 
log that can predict global distributions such as magnitude, red- 
shift and colors counts. We use the Coleman Extended library 
(CE) for our templates of galaxies. This list is based on the four 
observed spectra of Coleman et al. (1980) corresponding to an 
Elliptical, Sbc, Scd and Irregular, which have been extrapolated 
in the UV and IR wavelengths domain by using synthetic spectra 
from the GISSEL library (Chariot and Bruzual,1996). To repro- 
duce observed colors bluer than the CE templates, we add one 
spectrum of star-forming galaxies computed with the GISSEL 
model for solar metallicity, Salpeter IMF, constant star forma- 
tion rate and 0.05 Gyr age. Following the approach adopted by 
Sawicki et al. (1996), we have linearly interpolated the original 
5 SEDs to provide a finer grid of spectral-type coverage with a 
total library of 66 templates shown in Figure 1 . 

2.1.2. Extinction 

We have diversified our templates in adding extinction coming 
from the Calzetti extinction law (Calzetti et al. 2000) with the 
excess redenning values E(B - V) of 0,0.1,0.2,0.3 mag using the 
extinction formula: 

fl fl m in-l).4»i(i).£(B-V) /i\ 

J lUX a ttenuated\ A ) — J L ux intrinsic\ /i )-l<J t-U 

The k(A) values follow the Calzetti law. Each E(B - V) value 
will be used to create a template adding diversity to the tem- 
plates already existing. This has been applied to the irregular 
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Fig. 2. Four of the CWW templates: Ell (red dotted), S hc (gold 
dashed), S c j (green long-dashed), Irr (cyan dot-dashed) and the star- 
forming (blue long dot-dashed) shown at some given redshift of 
0.5,1,1.5,2,2.5,3. The templates are M* galaxies as given in Table 2. 
This is useful to show the color of these templates which may help in 
deciding which sensitivities are needed to detect galaxies in different 
passbands. 

Table 1. Restframe colors of SED templates. 



LF 


E(B-V) 


types 


B — V(rest frame) 


1 





1 -» 17 


1.069 -> 0.701 


2 





17 ^ 55 


0.701 -> 0.2167 


3 





55 ^66 


0.2167 -> -0.0301 


3 


0.1 


56^66 


0.3252 -> 0.1016 


3 


0.2 


56->66 


0.4592 -> 0.2345 


3 


0.3 


56->66 


0.5945 -» 0.3686 


Main templates 


E(B-V) 


nbr 


B — V(rest frame) 


Ell 





1 


1.069 


S be 





21 


0.6262 


S cd 





36 


0.5077 


Irr 





51 


0.3185 


star — forming 





66 


-0.0301 



and star-forming templates increasing the number of templates 
for those types. Finally we also include the Lyman absorption by 
Intergalactic Medium as a function of redshift following Madau 
(1995). 



We have extrapolated these LFs to z — 6 to be complete for the 
galaxies likely to be used in future surveys. The Dahlen z = 1 LF 
for each type is held nearly fixed for 1 < z < 6, with some ad- 
justments made to improve the match to data from the COSMOS 
survey (described below). 

Table 2 gives the LF calculated for 3 galaxy types extrap- 
olated until redshift z — 6 as explained above. To produce the 
mock catalog, knowing the luminosity function, Le Phare de- 
rives a number of object by magnitude and redshift bin (z, m) 
using a Schechter function (Schechter 1976) : 



n(M(z, m))dM ■ 



M(z, m) 
M* 



exp 



M(z, m) \ dM 
M* I ~M* 



(2) 



M is the absolute magnitude which is a function of redshift and 
apparent magnitude (z, m), M*,<p* and a are the parameters of 
the Schechter function given in the table 2. 

According to the adopted area, the simulated galaxy counts 
in each magnitude-redshift-type bin are drawn from a Poisson 
distribution of the expected number based on the LF. 

Note that the luminosity function parameters depend on the 
cosmological model assumed (as the luminosity function is ex- 
pressed in physical units), however the resulting galaxy catalog 
can be considered independent of the input cosmological param- 
eters assumed here, as it is built to reproduce the observed galaxy 
distribution and properties. 



Table 2. Luminosity function in the B band extrapolated to match 
COSMOS colors and number counts. Those values area the one sed 
in GLFC. 



LF 


z 


f 


M* 


a 




0.5 


16.7e-4 


-21.15 


-0.71 




0.75 


18e-4 


-21.13 


-0.50 




1.0 


9.5e-4 


-21.72 


-0.98 




2.0 


8.5e-4 


-21.72 


-0.98 




3.0 


7.5e-4 


-21.72 


-0.98 




6.0 


6.5e-4 


-21.72 


-0.98 


2 


0.5 


24.0e-4 


-21.08 


-1.37 


2 


0.75 


25.0e-4 


-21.29 


-1.17 


2 


1.0 


25.0e-4 


-21.2 


-1.1 


2 


1.5 


24.4e-4 


-21.15 


-0.05 


2 


2.0 


24.4e-4 


-21.15 


-0.99 


2 


3.0 


24.4e-4 


-21.15 


-0.99 


2 


6.0 


24.4e-4 


-21.15 


-0.85 


3 


0.5 


48.6e-4 


-18.84 


-1.1 


3 


0.75 


33.1e-4 


-19.67 


-1.18 


3 


1.0 


33.1e-4 


-20.55 


-1.52 


3 


1.5 


30.0e-4 


-20.62 


-1.62 


3 


2.0 


25.0e-4 


-20.72 


-1.7 


3 


3.0 


25.0e-4 


-20.72 


-1.8 


3 


6.0 


25.0e-4 


-20.72 


-1.8 



2.1.3. GLFC Luminosity Function 

The GLFC is generated by assuming a luminosity function (LF) 
for galaxies of 3 different types, then drawing galaxies from 
these LFs to populate our simulated sky area. We have selected 
these 3 types following the B-V restframe colors given in Dahlen 
et al. (2005) (see table 1). We start with the LF estimated in rest- 
frame B band by Dahlen et al. (2005) in the 3 spectral types in- 
side 4 redshifts bins over < z < 1 . Dahlen et al. (2005) derive 
these LFs by fitting to photo-z data from the GOODS survey. 



Drawing from the luminosity functions gives the galaxy dis- 
tribution over the joint magnitude-color-redshift space. We as- 
sign a half-light radius to each galaxy as follows: we first calcu- 
late the galaxy's apparent magnitude in the F814W HST filter. 
Leauthaud et al. (2007) provides a size-magnitude catalog for 
the 1 .64 deg 2 COSMOS survey executed in this filter. We assign 
the simulated galaxy a half-light radius that is drawn at random 
from all galaxies of the same F814W apparent magnitude in the 
observed COSMOS catalog. Thus the size-magnitude distribu- 
tion of the simulated catalog will, by construction, exactly match 



the COSMOS observations. The procedure does not reproduce 
any additional dependence of galaxy size upon type, color, or 
redshift using the COSMOS catalog. If needed, this can be im- 
plemented using a match between the simulated redshift and the 
COSMOS photometric redshift distribution. 

2.1.4. Photometric Noise 

A real survey has noise from the finite photon count and detector 
noise. Estimation of this noise is of course essential for forecast- 
ing survey performance. It may also be important for validation 
of the simulation against real data, since the noise can induce bi- 
ases on number counts. Le Phare produces both noiseless magni- 
tude and noisy magnitude for each chosen observation bandpass. 
The noisy magnitudes are randomly drawn inside a gaussian dis- 
tribution centered around (m, err m ). We define a reference couple 
(m\ err*) calculated from the survey caracteristics and the error 
err m at any magnitude m. At magnitude m < m*, the object 
noise is dominating and we assume that the magnitude error 
follows a power law of the adjustable form : 

err m = lQ(0Mp + D(m-m')) (3) 

At magnitude m > m*, the sky noise is dominating and we 
assume the error on magnitude follows an exponential law : 

err m = err* /2.72.exp(10 (q(m - m " >) ) (4) 

(p, q) are the slopes for the power laws, both derived from the 
survey caracteristic. 

We will, however, use noiseless magnitudes for the valida- 
tion tests in this paper, because the comparison surveys {e.g. the 
UDF) have high S IN in the regimes of comparison, and because 
the noise-induced biases are generally small. 

2.2. CMC - The COSMOS Mock Catalog 

The second simulated galaxy catalog is built directly from the 
observed COSMOS catalog of Capak et al. (2009) in prep, 
and Ilbert et al. (2009). We will refer to this simulation as the 
COSMOS mock Catalog (CMC). 

2.2.1 . The COSMOS catalog 

The COSMOS photometric -redshift catalog Ilbert et al. (2009) 
was computed with 30 bands over ~2-deg 2 taken from Galex for 
UV bands, Subaru for the optical (U to z) and CFHT, UKIRT 
and Spitzer for the NIR bands. However, we restrict our mock 
catalog to the central square area of 1.38 deg 2 fully covered by 
HST/ACS images. The COSMOS-ACS catalog gives 592000 
galaxies for an area of 1.38deg 2 . This is roughly a density 
of 120 galaxies by square arcmin down to i + < 26.5. In the 
COSMOS photometric -redshift catalog, 10% of this surface cor- 
responds to masked areas because of bright stars that prevent 
multiband quality photometry with a deep photometry in the ex- 
tended bright star halos. The effective area is thus in fact 1.24 
deg 2 of unmasked region with a total number of 538000 simu- 
lated galaxies out to i + < 26.5 leading again to roughly a den- 
sity of 120 gal/arcmin 2 . Point sources such as stars and X ray 
sources (mostly dominated by an AGN) were also removed from 
the mock catalog. 

Based on a comparison between the photo-z and differ- 
ent spectroscopic sample like the zCOSMOS sample (bright 
down to i*. < 22.5 with 4148 galaxies and faint down to 



'med < 24 with 148 galaxies) and also the MIPS infrared se- 
lected sample bright and faint with 317 galaxies, Figure 7 
and 8 of Ilbert et al. (2009), the COSMOS photo-z accuracy 
is cr Az / (1+Zs) = 0.007 at i + < 22.5 with a catastrophic rate be- 
low 1% see Figure 6 of Ilbert et al. (2009). At fainter magnitude 
and z < 1 .25, the estimated accuracy is cr Az = 0.02, 0.04, 0.07 at 
i+ ~ 24, i + ~ 25, i + ~ 25.5, respectively. The accuracy is de- 
graded at ; + > 25.5. The deep NIR and IRAC coverage enables 
the photo-z to be extended to z ~ 2 albeit with a lower accuracy 
(o"Aj/(i+ 2s ) = 0.06 at ;'+ B ~ 24) see Ilbert et al. (2009) for more 
details. 

Despite the lower photo-z accuracy at ; + > 25.5 andz > 1.25 
and possible bias of faint AGN contribution, we used the full 
COSMOS catalog. Indeed, the photo-z accuracy is not crucial 
for the simulation. The COSMOS catalog is only used to obtain 
a representative population of galaxies in term of density and 
mix of galaxy types. Since the predicted apparent magnitudes 
are calculated from the best-fit templates, the photo-z accuracy 
has no impact on our ability to link the simulated redshift to the 
predicted colors. The only risk of including lower quality photo- 
z's is to degrade slightly the catalog representativity, possibly 
biasing the redshift distribution at z > 1.25. 

2.2.2. The mock catalog construction 

The principle of our simulation is to convert the observed prop- 
erties of each COSMOS galaxy into simulated properties that 
can then be viewed using any possible instrument configuration. 
A photo-z and a best-fit template (including possible additional 
extinction) are associated with each galaxy of the COSMOS cat- 
alog. 

The first step is to integrate the best-fit template (in the ob- 
server frame) through the instrument filter transmission curves 
to produce simulated magnitudes in the instrument filter set. 

The second step is to apply random errors to the simulated 
magnitudes, based on the magnitude-error relations established 
in each filter (see Section 2.1.4). 

Importantly, all the COSMOS measured properties are prop- 
agated to the simulated galaxies, for instance, the galaxy half- 
light radius as measured on the ACS images by Leauthaud et al. 
(2007). 

This approach presents the following advantages: 

- the simulated mix of galaxy populations is, by construction, 
representative of a real galaxy survey, 

- additional quantities measured in COSMOS (like the galaxy 
size, UV luminosity, morphology, stellar masses, correlation 
in position) can be easily propagated to the simulated cata- 
log. 

The COSMOS mock catalog is limited to the range of magni- 
tude space where the COSMOS imaging is complete (i+ B ~ 26.2 
for a 5cr detection, see Capak et al. (2007) and Capak et al. 
(2009) in prep. 

2.2.3. Simulating emission lines 

For each galaxy of the COSMOS mock catalog we have asso- 
ciated emission line fluxes. This feature is useful to predict the 
size and the depth of a spectroscopic redshift sample. We mod- 
eled the emission line fluxes (Lya, [Oil], H/?, [OIII] and Ha-) of 
each galaxy as explained below. 

We used the method described in Section 3.2 of Ilbert et al. 
(2009). Using the Kennicutt (1998) calibration, we first esti- 
mated the star formation rate (SFR) from the dust-corrected 



UV rest-frame luminosity already measured for each COSMOS 
galaxy. The SFR can then be translated to an [Oil] emission 
line flux using another calibration from Kennicutt (1998). We 
checked that the relation found between the [Oil] fluxes and the 
UV luminosity is in good agreement with the VVDS data (see 
Fig.3 of Ilbert et al. (2009)) and still valid for different galaxy 
populations. For the other emission lines, we adopted intrin- 
sic, unextincted flux ratios of [OIII]/[OII] = 0.36; H0/[OII] = 
0.28; Ha/[OH] = 1.77 and Lya/[OII] = 2 (McCall et al. (1985), 
Moustakas et al. (2006), Mouhcine et al. (2005), Kennicutt 
(1998)). The approach of determining the Lya line flux through 
its ratio with the Oil emission line flux is perhaps not a good de- 
scription but since the Lya line becomes visible up to z ~ 3 for 
optical-NIR wavelength range it will not have a big impact on the 
studies about future optical-NIR dark energy surveys. Finally, 
we reduce each galaxy's line flux using the best-fit dust attenu- 
ation found with the template fitting procedure in the COSMOS 
photo-z catalog. 

The same procedure can be applied to the GLFC since we 
can calculate the UV absolute luminosity the same way as for 
the CMC. 

The COSMOS mock Catalog (CMC) has the advantage over 
the GLFC that it better preserves the relations between galaxy 
size and color (and presumably type and redshift). The CMC 
may also more accurately reproduce color distributions, since 
the population has not been reduced to three galaxy types as in 
the GLFC. The CMC is, however, limited by construction to the 
range of magnitude space where the COSMOS imaging is com- 
plete, whereas the GLFC can be extrapolated to fainter galaxies. 
Our validation tests below will compare the differences of these 
two simulation approaches. It is important to stress that all the 
following galaxy number densities correspond to mask corrected 
areas. Hence for real surveys, those numbers would have to be 
reduced by a factor of about ten % as observed in COSMOS 
field. 

3. Validation 

The aim of these mock catalogs is to predict the performance 
of future surveys such like JDEM and EUCLID. The depth of 
the foreseen surveys may be significantly fainter than the deep- 
est existing spectroscopic and NIR imaging wide field surveys, 
so a comprehensive observational validation of the catalog is 
not yet possible especially in terms of NIR photometry and red- 
shift distribution. We should, however, prove that the GLFC and 
CMC are consistent with existing data. For this validation, we 
will compare the galaxy counts, color distributions, redshift dis- 
tributions, and emission-line distributions to a selection of the 
deepest available relevant survey data. The comparison data are 
taken from: 

- The HST Ultra-Deep Field (UDF) catalog (Coe et al. 2006) 
2 and included references covers 11.97 arcmin 2 in the 4 
HST/ACS filters F435W, F606W, F775W and F850LP, re- 
ferred to as B, V, i, and z bands, respectively and 5.76 
arcmin 2 for the NICMOS filters Fl 10W and F160W respec- 
tively the J and H-band. The UDF detects objects at 10cr for 
z < 28.43, i < 29.01 and J < 28.3, significantly fainter than 
the expected depth of any of the aforementioned proposed 
surveys. Due to the small area, the UDF is more sensitive to 
the cosmic variance, but it is useful for comparisons at faint 
magnitude. 

2 http ://adcam. pha. j hu.edu/- coe/UDF/ 



- The GOODS (Great Observatories Origins Deep surveys) 
Giavalisco et al. (2004) vl.l survey catalog 3 . The GOODS 
observations are split into northern and southern fields. Each 
field covers 160 arcmin 2 in the same BViz filters as the UDF, 
and is 90% complete at z ~ 26. 

- The GOODS-MUSIC (MUltiwavelength Southern Infrared 
Catalog) sample Grazian et al. (2006) has been constructed 
with public data of the GOODS-S field, NIR Spitzer data 
from IRAC instrument (3.6, 4.5, 5.8 and 8.0 ji m) and 
U-band data from the 2.2ESO and VLT-VIMOS covering 
140 arcmin 2 . This catalog is both z and Ks-selected and is 
90% complete for K s < 23.8 and z < 26. 

- The VVDS-DEEP first epoch public release 4 includes pho- 
tometric data in BVRI VIRMOS-VLT bands over 0.49 deg 2 
McCracken et al. (2003); a spectroscopic survey of targets 
with Iab < 24 (Le Fevre et al. 2005) with a sampling rate of 
0.2 and a mean redshift of 0.86. 

- The VVDS-DEEP NIR J and K s photometry (Iovino et al. 
2005) covers 170 arcmin 2 and is complete for Ks < 22.5. 
This sample contains the BVri VIRMOS-VLT band and 
the JKs ESO/NTT (New Techonology Telescope) using the 
SOFI Near Infrared imaging camera. 



A cut in the size-magnitude plane is applied to each dataset in or- 
der to remove stellar contamination, in case this has not already 
been done for published catalogs. 

For each of the above listed surveys, both a CMC and a GLFC 
are constructed as described above, projecting both mock cata- 
log onto the real surveys' filter sets. 



3.1. Galaxy Counts 

We compare the simulated galaxy counts (dN/dm) to those in the 
real catalogs listed above. The SEDs of each simulated galaxy is 
integrated over the bandpass of each real catalog. 

In Figure 3 we compare the differential galaxy counts in the 
B, V, i, and z bands of the UDF and GOODS to those synthesized 
from the GLFC and CMC simulations. The UDF counts in the 
NIR are also compared to the simulations. Excellent agreement 
is seen between all sources for 21 < m < 26, with some ten- 
dency for the UDF counts to be lower than other surveys. We 
attribute this to cosmic variance because of the very small UDF 
survey area. The difference between the simulated catalogs and 
the GOODS South is generally less than the difference between 
GOODS North and South (« 15%). Note that the CMC becomes 
incomplete for m > 26 (m > 25 in the 1 .6 fim band), so will un- 
derestimate the galaxy yield in surveys deeper than these limits. 

We also compare in Figure 3 the simulated Ks band counts 
to the GOODS-MUSIC and VVDS data. The tf s -band VVDS 
data of Iovino et al. (2005) are 90% complete at Iab = 25.5. 
We see a very good agreement between the VVDS, GOODS- 
MUSIC and simulated catalogs at Ks < 20 and some discrep- 
ancies at fainter magnitudes. However, we measure a mean less 
than 17.9% difference between VVDS Iovino and CMC, 14.4% 
difference between VVDS Iovino and GLFC and 17.4% between 
GOODS-MUSIC and VVDS Iovino within the observation lim- 
its of these 2 surveys. We conclude that both simulated catalogs 
are well reproducing the Ks counts. 




20 25 20 25 20 25 



AB magnitude 

Fig. 3. Differential galaxy counts for F435W (B band), F775W (i band), 
F850LP (z band), F160W (H band) and the Ks band. The blue solid line 
is the COSMOS mock catalog (CMC) and the black dot-dashed line is 
the Le Phare simulation based on the GOODS LF (GLFC). The real 
observations are: the UDF (red dotted); GOODS North (long-dashed 
green) and South (dashed magenta). Note the excellent agreement be- 
tween all simulations and real data where each is complete. 
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Fig. 4. Color histogram of F435W (B Band)-F8501p (z Band). The blue 
solid line is the simulation based on the COSMOS catalog (CMC) in 
GOODS filters and the black dotted line is the Le Phare simulation 
based on the GOODS LF and filters (GLFC). The real observations 
are: the UDF (dot-dashed red); GOODS North (long-dashed green) and 
South (dashed magenta). The indicated cut in S/N and magnitude are 
applied to F850LP (z Band) for each catalog. 



3.2. Colors 

Figure 4 compares the B — z color distribution of the simulated 
catalogs to those of GOODS and UDF galaxies. The magnitude 
cuts in the B and z bands are taken following the COSMOS com- 
pletness in each of these bands. There is a good agreement in 
these optical wavelengths. The simulated catalogs have median 
and mean B — z colors in agreement at a few percent with the 
UDF and GOODS catalogs (see table 3). 

The UDF seems relatively deficient in the reddest galaxies, 
again perhaps a manifestation of cosmic variance, which is most 
severe for the highly-clustered red-sequence galaxies. This is an 
expected result and is not an issue with the catalogues since 
the UDF may be under-dense due to its small survey area. 



Table 3. Mean and Median of the B — z color distributions with the 
magnitude cuts of Figure 4 corresponding to the completness of the 
CMC. 



Catalogs 


Mean 


Median 


survey area 


mag limit 


CMC(GOODS) 


1.24 


1.14 


1.24 deg 2 


I -26 


GLFC(GOODS) 


1.2 


1.05 


0.1 deg 2 


R-31 


GOODS-S 


1.26 


1.1 


160 arcmin 2 


z - 26 


GOODS-N 


1.3 


1.13 


160 arcmin 2 


z - 26 


UDF 


1.16 


1.06 


1 1.97 arcmin 2 


z - 28.43 



Figure 5 compares simulated B - Ks colors to those in the 
GOODS-MUSIC and VVDS-DEEP catalog. The color distribu- 
tions in NIR agree well inside the completeness limits imposed 
by the different surveys, the CMC z < 25.5 and B < 26.5 and the 
GOODS-MUSIC survey Ks < 22.5. The GOODS-MUSIC data 
have lower B-Ks counts which is explained by the number count 
difference at Ks > 20 in the Ks-band between both simulated 
catalogs and the Grazian count see Figure 3. However, the mean 
and median are in good agreement with this latter. The agree- 
ment is much better in Figure 5b between the CMC, GLFC and 
VVDS-DEEP data. 

3.3. Redshifts 

Comparison of the simulated redshift distributions to real data is 
likely less accurate than colors or magnitudes, since real redshift 
surveys are much shallower and have significant incompleteness. 
Nonetheless we plot in Figure 6 the median redshift vs /-band 
magnitude for the CMC and GLFC catalogs and the VVDS sur- 
vey. 

The redshift quartiles of the CMC agree wery well with the 
VVDS spectroscopic redshift distribution to the / = 24 limit 
of the latter. The GLFC seems to have a lower median redshift, 
probably due to the GOODS LF used to produce the catalog. 
The mean difference of the median redshift is 0.02 and respec- 
tively 0.07 for the CMC and GLFC catalogs. We conclude not 
surprisingly that the CMC is probably a better representation of 
the magnitude-redshift distribution, and is as accurate as current 
data can validate. Figure 7 plots median B - i color vs redshift 
for the simulated catalogs CMC and GLFC and for the VVDS- 
DEEP. All catalogs have been restricted to / < 24, the domain 
of the VVDS-DEEP redshift survey. The colors agree to 0.1-0.2 
mag until z ~ 2. Above this redshift the VVDS survey is likely 
highly incomplete in this regime, even for blue galaxies, as no 

3 http://archive.stsci.edu/prepds/goods/ 

4 http://cencos.oamp.fr/ 
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Fig. 5. Color histogram of F435W (B Band)-Ks. The blue solid line is 
the COSMOS mock catalog (CMC) and the black dotted line is the Le 
Phare simulation based on the GOODS LF (GLFC). The observational 
data are the GOODS-MUSIC (top panel) catalog in dashed magenta 
and VVDS (bottom panel) catalog in dashed green. We choose to cut at 
22.5 AB mag in the Ks Band due to the area variability of the GOODS- 
MUSIC sample above this magnitude and the VVDS completness. 



strong emission lines are available in the VVDS spectral wave- 
length range and due to the / = 24 magnitude cut see Figure 6 
for the median redshift and quartiles of the distribution. 

We compare the redshift distributions Figure 8 for different 
magnitude cuts. 

The CMC and the VVDS-DEEP catalog show very good 
agreement. This figure shows that both mock catalogs are well 
reproducing the number density inside the VVDS volume : (0 < 
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Fig. 6. Median redshift distribution function of the magnitude i Band 
VVDS magnitude. The continuous lines are the median redshift and 
the dotted lines represents the quatiles. The blue line is the simulation 
based on the COSMOS catalog (CMC) in VVDS filters and the black 
line is the Le Phare simulation based on the GOODS LF in VVDS filters 
(GLFC). The real observation is the VVDS-DEEP catalog (green). The 
indicated cut in S/N is applied to I VVDS Band. We note the very good 
agreement between the median redshift of the COSMOS and VVDS- 
DEEP catalogs. 



Z < 1.5 and / < 24) and agree at higher magnitude and redshift. 
The GLFC is based on the GOODS luminosity function Dahlen 
et al. (2005) which has equivalent depth to the VVDS galaxy 
sample. However, in looking at Figure 6, the VVDS magnitude- 
redshift distribution seems in better agreement with the CMC 
than the GLFC. 



3.4. Emission-Line Strength 

Future dark energy surveys need large spectroscopic redshift 
samples to calibrating photometric redshifts, measuring ac- 
curately the BAO and for other probes using spectroscopic 
samples of galaxies. Thus, this is crucial to have realistic 
emission lines allowing to make predictions on the success 
rate, depth, and size of spectroscopic samples that we will 
need to answer fundamental questions which justify the fu- 
ture surveys we are planning. Although, the CMC does not 
reproduce absorption lines, it simulates emissions lines for 
all galaxies in the catalogue which allows a first estimation 
of spectroscopic survey capacities. The validation of the emis- 
sion line strength can be found in Figure 3 of Ilbert et al. (2009). 
This figure shows the relation between the Oil flux and the rest- 
frame UV luminosity predicted by Kennicutt (1998) : 



log[OII] = -OAMuv + 10.57 



DM(z) 
2.5 



(5) 



It shows very good agreement between VVDS data from 
Lamareille et al. (2008) and the simulated emission lines 
strength extrapolated from the photometric redshift best fit tem- 




Fig. 7. Median color of B-i VVDS band function of the redshift. The 
blue solid line is the simulation based on the COSMOS catalog (CMC) 
in VVDS filters and the black dotted line is the Le Phare simulation 
based on the GOODS LF in VVDS filters (GLFC). The real observation 
is the VVDS-DEEP catalog in dahed green). The indicated cut in S/N 
is applied to i Band VVDS and the i Band magnitude cut is the com- 
pletness limits of the VVDS-DEEP catalog. The error bars are drawn 
from the root of number of objects. There is a good agreement between 
the VVDS-DEEP and COSMOS catalog as well as the GLFC until a 
redshift around 1.8. 

plate between 0.4 < z < 1.4 where the Oil emission line can be 
measured. 

The histograms shown Figure 9 are the emission line ratios 
of the CMC compared with those of VVDS. The attenuation 
causes a spread on the emission line ratios but we had to add 
a gaussian dispersion to fit the VVDS emission line spread. 
Knowing that r is a number randomly drawn from a gaussian 
law (ji, cr 2 ) = (0, 1), we define the new fluxes as : 



Ha = Ha(l + \r/4\) 

Hp = Hp{\ + |r/2[) 

[OIII] 4959 = [<9///] 4 959(l + |1.2r|) 

[Oiir\ 5001 = [0///W1 + M) 



(6) 
(7) 
(8) 
(9) 



The spectral wavelength range of VVDS spans from 0.55 to 
0.94/mi (Le Fevre et al. 2005) making the [Oil] line visible from 
redshift 0.5 to 1.5 and the Ha line visible from redshift to 0.5. 
Thus we choose to evaluate the validity of the Ha fluxes using 
the ratio Ha /Hp, known as the Balmer decrement and having a 
value ~ 2.9. All CMC ratio spread fits well those of VVDS. 

For a further validation of the CMC emission lines we 
choose to reproduce the spectroscopic success rate (SSR) of the 
VVDS survey. To realise the CMC(VVDS) SSR we have ex- 
trapolated the flux sensitivity of the VVDS survey in fonction of 
wavelength for several signal-to-noise ratio based on the [Oil] 
emission lines. Using these sensitivities and the CMC emission 
line catalog we derive the simulated SSR for the VVDS survey 
in order to validate the emission line model of the CMC cat- 
alog (Figures 10 and 11) . Figures 10 shows the SSR for the 



Fig. 8. Redshift distribution. The blue solid line is the simulation based 
on the COSMOS catalog (CMC) in VVDS filters and the black dotted 
line is the Le Phare simulation based on the GOODS LF in VVDS filters 
(GLFC). The real observation is the VVDS-DEEP catalog in dashed 
green. The indicated cut in S/N is applied to I VVDS Band and the 
i Band magnitude cut is the completness limits of the VVDS-DEEP 
catalog. There is a very good agreement beetween the VVDS-DEEP, 
COSMOS and GLFC redshift distribution. 
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Fig. 9. Emission line ratios. The magenta histograms are the VVDS sur- 
vey and the black histograms the CMC. The red dotted line is the value 
of the theoretical emission line ratios. 



emission lines of the CMC catalog compared to those of the 
VVDS in fonction of redshift (top panel) and magnitude (bot- 
tom panel). The strong emission lines, Ha and Oil, are in very 
good agreement with the VVDS both in magnitude and redshift 
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Fig. 10. Comparison of the lines Spectroscopic Success Rate (SSR) of 
the VVDS survey with the simulated lines VVDS SSR using the CMC 
emission lines. Top panel shows the SSR of emission lines as a function 
of redshift, dashed lines for the VVDS survey and solid lines for the 
CMC. The blue lines are for the Oil lines, red for Ho-, cyan for H/?, gold 
for OHIa at 4959A and purple for OIII at 5007A. Bottom panel shows 
the same as the top panel as a function of magnitude. 
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Fig. 11. Comparison of the Spectroscopic Success Rate (SSR) of the 
VVDS survey with the simulated VVDS SSR using the CMC emission 
lines. Top panel shows the VVDS SSR for secure redshift (brown lines) 
and very secure redshift (orange line) as a function of redshift compared 
to the CMC emission lines 3<r VVDS flux detection (grey lines) and 2 
lines at 5cr detection or Oil at 5<r detection with V < 24. Bottom panel 
shows the same as the bottom panel as a function of magnitude. 



space. However weaker emission lines have a discrepancy from 
10 to 20 % with the VVDS emission lines at low magnitudes 
especially for the [OIII] lines at 4959A and 5007A but it agrees 
very well at magitudes I > 21. Figure 11 compare the overall 
SSR predictions of the CMC with the VVDS secure and very 
secure redshifts. We see that the VVDS secure redshifts success 
rate are very close to the 3sig detection line of the CMC cat- 
alog. In the same way, the VVDS very secure redshift success 
rate agrees very well with the 2 lines detection at 5cr including 
galaxies with V < 24 which corresponds to a detection of the 
continuum in the VVDS spectra with a S /N > 10 in the blue 
part of the spectrum (which is free of strong sky emission lines). 
The differences mainly comes from the redshifts effectively ob- 
tained using absorption lines for which the shape of the contin- 
uum altogether does not match exactly with our ad-hoc V-band 
magnitude criterion. Although, the CMC does not simulate the 
absorption lines, we can say that the CMC emission line shows a 
good agreement both in redshift and magnitude with the VVDS 
spectroscopic survey. These results makes us confident in using 
the CMC catalog as a tool to predict the SSR of future wide field 
spectroscopic surveys. 



4. Discussion 

Different cosmological tests have been proposed to probe the 
geometry and growth of structures in order to shade new 
light on the nature of dark energy. The best approach is cer- 
tainly to combine different probes to reduce the errors and 
better understand the systematics. However, each probe has 
its own requirement and it is a technical and scientific chal- 
lenge to design a telescope optimised for more than one cos- 
mological probe. Nonetheless, the goal of future dark energy 
programs should find a survey strategies that will lead to the 
best combination efficiency of different probes. 

Having a realistic description of galaxy properties in our 
Universe is key to properly forecast what future deep and 
wide surveys can achieve for a given survey configuration. 
In this section, we start a discussion about WL requirements 
with the shape measurement and the photo-z calibration sur- 
vey (PZCS) which is required in order to use weak lensing 
tomography to constrain Dark Energy parameters that we 
aim to combine with BAO estimations of the Dark Energy 
parameters. 

Thus, we will discuss two important aspects of these fu- 
ture dark energy mission: (i) the impact of galaxy size in 
terms of shape measurement for weak lensing depending on 
the PSF size (ii) the different typical WL and BAO survey 
configuration we have tested using our mock catalogues and 
( Hi) their expected spectroscopic success rate. 

Note that we are not conducting here any optimisation 
of such space mission and its survey strategy. We are just 
putting in place some of the needed tool necessary for such 
optimisation. 



4.1. Galaxy-sizes 

A comparison of the simulated galaxy-size distribution to obser- 
vational data is not particularly useful. Indeed the simulated cat- 
alogs agree, by construction, with the size distribution measured 
in the COSMOS ACS imaging survey Koekemoer et al. (2007), 
Leauthaud et al. (2007). No other survey capable of resolving 
faint galaxies approaches the size of the COSMOS/ACS data. In 
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Fig. 12. Cumulated Half-light radius distibution by bins of magnitude. 
The brown area is the PSF size of ground-based telescope, the grey 
area is an estimation of the PSF size of future dark energy mission and 
the black dashed line is the measured PSF size of the COSMOS space 
survey (Leauthaud et al. 2007). 



Figure 12 we show the cumulated half-light radius distribution 
for the COSMOS data. 

The brown area corresponds to the typical PSF size of 
ground-based telescopes. It shows that we can only resolve and 
measure galaxy shapes (those galaxies with a size larger than 
the PSF size) for about 15gal/arcmin 2 (or up to 20gal/arcmin 2 in 
case of excellent seeing) as soon as a depth of / ~ 25 is reached. 
Going deeper than I = 25 will not help to raise this number den- 
sity as fainter galaxies are smaller than the PSF, thus making a 
shape measurement impossible. 

The grey area corresponds to the PSF size expected with fu- 
ture space Dark Energy mission; the exact value of the PSF size 
depends on the telescope diameter and the chosen pixel scale 
(here we assume a telescope diameter of ~ 1.5-1.8 meter). 
Interestingly, with this small PSF size, the number density of re- 
solved galaxies does increase as a function of the depth of the 
survey. Beyond / = 26 the increase in number density is how- 
ever smaller, suggesting that there is a limited gain to go much 
deeper than / = 26 except may be for telescope designs with the 
smallest PSF size. With a depth of / ~ 25.5-26 the number den- 
sity of resolved galaxies in the foreseen space surveys is about 
3x larger than what ground-survey can achieve. 

Contrary to the ground case, the performance of a space sur- 
vey in terms of galaxy number density critically depends on the 
caracteristic of the survey in terms of depth and image resolu- 
tion. The black dashed vertical line correspond to the PSF size 
of the ACS/HST camera as measured in the COSMOS survey 
Leauthaud et al. (2007). The COSMOS survey reach a num- 
ber density of resolved galaxies of ~ 1 lOgal/arcmin 2 down to 
/ ~ 26.5. It is important to recognized however that not all these 
galaxies could be used in the COSMOS 3D weak lensing analy- 
sis as explained in Leauthaud et al (2007). For a fraction of these 
galaxies there was no counterpart in the ground-based catalogue 
because of masking of the data. For the remaining galaxies, it 



was not always possible to determine securely the shape or the 
redshift thus decreasing further the number of usable galaxies to 
roughly 40gal/arcmin 2 . 

Cosmological weak lensing survey aim to maximize the 
number of resolved galaxies that can be used for weak lensing 
tomography. As shown by Amara & Refregier (2007) it is more 
efficient to conduct a wider survey than a deeper survey as the 
galaxy number increase is larger for a given exposure time by 
going wide than by going deep. However the sky brightness and 
the variable star density is limiting this prescription to typically 
10000 to 15000 deg 2 , a quarter of the sky area. 

4.2. BAO and WL spectroscopic survey requirements 

We investigate here three types of spectroscopic redshift 
survey (see Figure 13 and Table 4) whose characteristics 
correspond to typical requirements for the BAO and WL 
probes. 

(i) BAO aims to measure the baryon accoustic oscilla- 
tion peak of the 2-pt correlation function as a measure of 
a standard ruler. It requires a large survey area to reduce 
the statistical noise (Glazebrook & Blake 2005; Blake & 
Glazebrook 2003) and an accurate redshift for each galaxy, 
typically using spectroscopic techniques to reach a higher ac- 
curacy. There are studies about using photometric redshift 
instead of spectroscopic redshift but it then needs a larger 
area (typically a few times larger in area) due to the precision 
loss of the scales along the line of sight to reach the same dark 
energy figure of merit (FOM) Glazebrook & Blake (2005). 
Following this last paper, we develop a WIDE near infrared 
(1.0 to 1.7 micron) spectroscopic survey reaching a 3<x sensi- 
tivity of lxl0~ 16 ergs cm~ 2 s~ l at 1.2/im (this can be achieved 
with an efficient slitless spectrograph using a 1.5m telescope, 
0.28"/pixel, R~500 and an exposure time of 1200 sec) and 
covering a large area on the sky to maximise the dark energy 
FOM. 

(ii) The weak-lensing tomographic analysis is likely the 
most efficient lensing method to estimate the Dark energy 
parameters (Massey et al. 2007), (Amara & Refregier 2007). 
This method probes the growth of structures but needs to 
have accurate redshifts to place the background sources at 
their correct location. To achieve this measurement, it is es- 
sential to determine with the best accuracy (and minimal bi- 
ases) the photometric redshift (photo-z) of all the galaxies to 
be used in the weak lensing tomography. Ma & Bernstein 
(2008) have shown that a high accuracy photo-z is needed 
to avoid any bias on the Dark Energy parameter estimation. 
The only way to reach high accuracy photo-z is to plan a 
spectroscopic redshift survey to calibrate the photo-z tem- 
plates similarly as shown in Ilbert et al. (2006). Ideally, the 
photo-z calibration survey (PZCS) would reach the same 
magnitude and redshift depth as the WL photometric sur- 
vey. However, this goal will likely be difficult to achieve and 
the CMC may help to foresee the best spectroscopic surveys 
through its emission line catalog usefully complementing the 
photometric catalog. Thus, we design a DEEP-visible-NIR 
survey (0.6 to 1.7 microns) for weak-lensing reaching a 3<x 
flux sensitivity of 5.10 18 ergs crrr 2 s~ x at 1.2/Km (this can be 
achieved with an efficient slitless spectrograph using a 1.5m 
telescope, 0.28"/pixel, R~250 and an exposure time of 240 
ksec~67 hours) in order to have a large number of spectro- 
scopic redshift allowing a full calibration of the photometric 
redshift for a wide range in redshift and magnitude. 



( Hi) We also desgin a DEEP-NIR survey having the same 
caracteritics that the DEEP-visible-NIR survey but covering 
only the NIR part (1.0 to 1.7 micron; focussing on the low IR 
background of space observation). Our goal here is to evalu- 
ate the importance of having a spectroscopic contribution in 
the visible wavelength to reach a high completness. 

4.3. SSR prediction 

In the following sections we will give the basics of photo-z cal- 
ibrations survey studies based on spectroscopic success rate as 
a function of redshift and magnitude. A more thorough analysis 
will be developed in more details in a forthcoming paper (Jouvel 
et al 2009 in prep). 
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Fig. 13. Flux sensitivities for VVDS-DEEP survey (~10ksec exposure 
on a 8m ground-based telescope with a spectroscopic resolution of 
R=250) at 3cr in green and 5<x in gold as a function of wavelength 
compared to space sensitivities for ( i) a WIDE space near infrared sur- 
vey reaching a 3<r flux sensitivity of l(T 16 erg/s/cm 2 at 1.2yum [The cyan 
curve for the WIDE NIR survey sensitivity at 3cr.]; (ii) a DEEP-visible- 
NIR survey reaching a 3tr flux sensitivity of 5.1CT 18 erg/s/cm 2 at 1.2yum. 
[The black curve for the DEEP-visible-NIR space survey sensitivity at 
3cr and the red curve at 5cr.] We will also consider the DEEP-NIR sur- 
vey which is identical to the DEEP-visible-NIR but limited to the NIR 
domain [the purple area]. 



4.3.1 . SSR for the WIDE survey 

The wide survey would aim to cover a large fraction of the sky 
(10,000 deg 2 or more) in order to probe the large scale distri- 
bution of galaxies aiming in particular to measure the baryon 
acoustic oscillation with a great accuracy in the redshift range 
0.5 < z < 1.5 (or up to z ~ 2 providing the telescope and 
instrument is sensitive up to 2fim). Our SSR prediction, shows 
that such survey could easily measure the redshift of more than 
4 galaxy/armin 2 hence providing the redshift measurement of 
more than ~100 million galaxies for a survey covering 10,000 
deg 2 . The redshift identification is essentially based on the Ha 
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Fig. 14. WIDE NIR space survey SSR Top panel : SSR as a function of 
redshift limited to galaxies with H < 23.5. The long-dashed red curve is 
the SSR of the Ha 3cr detection line and the long-dashed blue curve the 
SSR of the [Oil] 3<t detection line. Bottom panel : SSR as a function 
of magnitude. Both panels : the orange lines shows the SSR using a 1 
at 5cr criterion, the grey curve using a 1 line at 3<x cireterion, and the 
purple using a continuum S/N>10 criterion. The solid curves are a per- 
centage with the total number of objects and the dashed for a cumulative 
number of objects by arcmin 2 . The purple curve shows the SSR. 



line detection thus essentially targeting star-forming galaxies 
with H < 22.5. The SSR clearly shows that such survey is 
far from being complete as the spectroscopic success rate is 
generally below 40% for any magnitude and redshift (for a 
5<x line detection). Indeed, there is no faint galaxies spec- 
troscopically found, especially in the redshift range which 
has the strongest need of photometric redshift calibration : 
< z < 0.5 and z > 1.5. We also note the severe glalaxy 
type incompletness even from early to late type galaxies, see 
Table 4. Thus, such wide survey can not be used to calibrate 
properly any photometric redshift catalogue. 

4.3.2. SSR for the DEEP survey 

Figures 15 and 16 show the SSR prediction for future dark en- 
ergy surveys which are planing to do infrared only (DEEP-NIR 
survey), and visible +infrared spectroscopy (DEEP-visible-NIR 
survey) from space. Figure 13 show the grism flux sensitivity 
we use in our analysis. 

We assume here that the visible spectroscopy spans 0.6-1/mi 
see Figure 13. Because of the limited wavelength coverage, the 
Ha is the only strong emission line visible for galaxies at low 
redshift (z<0.5). This explains that the detection criterion based 
on 2 lines, does not cover the low redshift range as shown in the 
top panel of Figure 15. This is also seen in the bottom panel 
of Figure 15 where there is a lower SSR for bright galaxies. 
However, for these low redshift (z < 0.5) galaxies, about 30% 
have a strong continuum with S/N>10. For these bright galaxies, 
we should be able to accurately measure a redshift using either 




Fig. 15. DEEP-visible-NIR space survey SSR. Top panel : SSR as a 
function of redshift. The long-dashed red curve is the SSR of the Ho- 
detection line and the long-dashed blue curve the SSR of the [Oil] de- 
tection line. Bottom panel : SSR as a function of magnitude. Both pan- 
els : the orange lines shows the SSR using a 2 lines with at least 1 at 5<x 
criterion, the grey curve using a 1 line at 3cr cireterion, and the purple 
using a continuum S/N>10 criterion. The solid curves are a percentage 
with the total number of objects and the dashed for a cumulative number 
of objects by arcmin 2 . The purple curve shows the SSR. 



Fig. 16. DEEP-NIR only space survey SSR. Top panel : SSR as a func- 
tion of redshift. The long-dashed red curve is the SSR of the Ha detec- 
tion line and the long-dashed blue curve the SSR of the [Oil] detection 
line. Bottom panel : SSR as a function of magnitude. Both panels : the 
orange lines shows the SSR using a 2 lines with at least 1 at 5cr crite- 
rion, the grey curve using a 1 line at 3<x cireterion, and the purple using 
a continuum S/N>10 criterion. The solid curves are a percentage with 
the total number of objects and the dashed for a cumulative number of 
objects by arcmin 2 . The purple curve shows the SSR. 



the Ha line detection, absorption lines and the shape of the con- 
tinuum which will be useful for photometric redshift calibration. 

If only the NIR spectroscopy is conducted from space, the 
survey will only reach ~ 35galaxy per arcmin 2 and a maximal 
SSR of at best 40% for / ~ 24. Using DEEP-NIR spectroscopic 
survey would help to calibrate spectra at 0.8 < z < 2.5 but would 
be less efficient below z < 0.8. There will be a strong need to 
conduct the visible spectroscopy to achieve a complete census 
of the redshift distribution. 

Using the DEEP-visible-NIR survey, we should be able to 
measure a very secure redshift (2 lines detected at more than 
3cr with at least 1 emission line at 5cr) for about 60 galaxy per 
arcmin 2 to / < 26 and z ^ 2.5 with a SSR reaching 80% at best 
for / ~ 24. With this density of sources, a survey of 1 square 
degree will provide 220000 spectra. Note however, that these 
numbers do not take into account the crowding of galaxy spec- 
tra which is likely a strong limiting factor above / ~ 24. There 
are ways to minimize the impact of crowding, for example by 
conducting the slit-less spectroscopic survey using different ori- 
entations, or by using masks to block some of the light (thus 
limiting the sky background and reducing the number of over- 
lapping spectra). These alternatives will be discussed in a forth- 
coming paper Zoubian et al. 2009 in prep. . Note also that the 
visible part of the spectroscopy might be conducted with a ded- 
icated ground-based spectrograph with high-multiplexing (this 
can probably be more efficient specially for A < 0.75//m where 
the sky emission lines are less numerous than in the redder part 
of the spectrum). 



Table 4. Characteristics of the three surveys discussed in the text, as- 
suming an efficient 1.5m space telescope with an obscuration of 0.6m 
and a total telescope throughput of 92% using a survey efficiency of 
75% . The %[lline3cr] Ell represent the percentage of Elliptical galax- 
ies spectroscopically found using a criterium of 1 line detected at 3cr 
compared to the total population of Elliptical galaxies. With the same 
criterium, we define the percentage of Sac for early spiral galaxies, Sdm 
for late spiral galaxies, and SB for starburst galaxies. 
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< z < 3.5 


0.5 < z < 3.5 


0.5 < z < 1 


rriian [lline3cr] 


I~ 27 


I~ 26.5 


H~22 


%[lline3o-] Ell 


40 


27 





%[lline3cr] Sac 


60 


51 


1 


%[lline3<x] Sdm 


77 


63 


10 


%[lline3cr] SB 


95 


74 


45 



5. Conclusion 

We have produced simulated deep galaxy catalogs by two dif- 
ferent techniques: one starts with redshift-dependent Schechter 



luminosity functions for three galaxy types for < z < 6, ex- 
trapolating the LFs derived from GOODS data by Dahlen et al. 
(2005) (the GLFC). The other has galaxies with redshifts and 
SEDs taken as the best photo-z fits to multiwavelength obser- 
vations of the COSMOS field (the CMC). Both adopt galaxy 
size distributions from the COSMOS HST imaging and we as- 
sign emission line strengths using a recipe in agreement with the 
VVDS-DEEP data. 

Both simulated catalogs do an excellent job of reproduc- 
ing the dN/dm data (< 20% discrepancies) and color distribu- 
tions (0.1-0.2 mag median color discrepancies) of galaxies ob- 
served in bands from 0.4 to 2.2 //m using a comparison with 
the GOODS and UDF surveys. The CMC provides an excellent 
match to the redshift-magnitude and redshift-color distributions 
for / < 24 galaxies in the VVDS spectroscopic redshift survey. 

These simulated catalogs thus pass all our validation tests 
for use in forecasting the galaxy "yields" of future visible/NIR 
imaging surveys, as long as we restrict our analysis to the / < 
25.5-26 galaxies for which the COSMOS HST imaging is highly 
complete. These simulated catalogs provide a conservative esti- 
mate of the yield for surveys that go deeper in the visible, or 
reach > 24 in the NIR, because they miss faint or red galaxies 
that do not make the /-band cut. 

The GLFC has the potential to forecast deeper surveys than 
the CMC, but only if we are ready to extrapolate the luminosity 
functions to fainter magnitudes, which is probably OK in the 
visible bands, but it may be more strongly limited in the near- 
infrared bands. 

In the CMC, the galaxy sizes are directly coming from the 
HST/ ACS measurement and give us the opportunity to evalu- 
ate the difference in terms of possible shape measurement for 
a ground-based and space based telescope. We showed that a 
ground based WL survey is limited to use only the resolved 
galaxy and thus it is basically independent of the survey depth 
for / > 25 (as most of / > 25 galaxies are smaller than the 
ground based seeing disk). For a space based survey the number 
of resolved galaxies is not only dependent of the PSF but also 
the depth of the survey. 

Moreover we have also used the CMC and the emission line 
model to compare the spectroscopic success rate observed by the 
VVDS and the one we can predicted based on the CMC line flux. 
In general we obtain a good agreement between the observed 
success rate and the CMC prediction. 

We have then used the emission line model to explore what 
will be the yield in terms of redshift measurement for 2 types 
of surveys: (i) a WIDE survey motivated by measuring BAO, 
and (ii) a deep survey motivated to calibrate the photometric- 
redshift. We found that the WIDE survey reaches a density 
of 4gal/arcmin 2 for galaxies H < 22.5 in a redshift range of 
0.5 < z < 1.5 mainly detected using the Ha emission line. 
Furthermore, in a weak-lensing perspective, a DEEP survey both 
visible and NIR is needed to conduct the photometric redshift 
calibration of galaxies (PZCS) to be used in a weak-lensing mea- 
surement covering the full magnitude and redshift range. Indeed 
such survey reaches a density of 60 gal/arcmin 2 for very secure 
redshift down to / ~ 25.5 with a high completness of ~80%. 
We note that the visible part of the spectroscopic survey can be 
done from the ground however above A ~ 0.75yum as we can 
see Figure 13 a space based survey is likely to be much more 
efficient than a ground based survey. 

We have thus demonstrated here how useful realistic simu- 
lated catalogue are for designing future DE space mission. We 
will investigate more precisely the details of the WL optimisa- 



tion survey strategy in a forthcoming paper (Jouvel et al. 2009, 
in prep. ). 
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